Skip to main content

Pre-Flight Check

Benchmarked against: Anthropic โ€” Effort (adaptive complexity) SOP: Cloud UB ub-a0914abf17a7 Scope: All agents, before every non-trivial task

The Pre-Flight Check is a 30-second self-assessment before any task. It scores risk across three dimensions and determines whether to proceed, research first, or escalate.


3D scoringโ€‹

Each dimension scores 0โ€“2. Total score determines action.

Dimension 1: Blast Radiusโ€‹

ScoreMeaning
0Local only (one file, one config)
1Module-level (multiple files, one service)
2Fleet-wide (cross-ship, cross-agent, production)

Dimension 2: Knowledge Freshnessโ€‹

ScoreMeaning
0Timeless knowledge (design patterns, algorithms)
1Perishable but recently verified
2Perishable, not recently verified

Dimension 3: Reversibilityโ€‹

ScoreMeaning
0Fully reversible (git revert, config rollback)
1Partially reversible (database migration, API change)
2Irreversible (data deletion, external publishing)

Action thresholdsโ€‹

Total ScoreColorAction
0โ€“2๐ŸŸข GreenProceed directly
3โ€“4๐ŸŸก YellowSearch UB first, then proceed
5โ€“6๐Ÿ”ด RedSearch UB + intel_search + report to Captain before acting

Exampleโ€‹

Task: Update LangGraph agent to use new checkpoint API

Blast Radius: 1 (module-level, affects agent pipeline)
Freshness: 2 (LangGraph API is Danger Zone, not recently verified)
Reversibility: 0 (can git revert)

Total: 3 โ†’ ๐ŸŸก Yellow โ†’ search UB first

Integration pointsโ€‹

  • Tech Freshness: Dimension 2 directly maps to the Danger Zone library list
  • HITL Boundary: Score 5-6 always requires Captain visibility (Constitution ยง5)
  • Work Orders: WO submission gates can require Pre-Flight Check record

When to skipโ€‹

The only time you can skip Pre-Flight Check is for truly trivial tasks:

  • Fixing a typo
  • Adding a log statement
  • Reading files for information

If you're writing code, changing config, or making any decision โ€” run the check.