con un clic
verification-before-completion
// Use when about to claim work is complete, fixed, passing, verified, release-ready, or ready to commit, merge, publish, or hand off.
// Use when about to claim work is complete, fixed, passing, verified, release-ready, or ready to commit, merge, publish, or hand off.
Use when the user explicitly sets an Aegis goal with /aegis-goal, Aegis goal:, or asks to define goal, success evidence, stop condition, or task boundaries before work.
Use when defining new features, product behavior, UI/component design, architecture choices, contract changes, or ambiguous medium/high-complexity work before implementation.
Use when the user explicitly asks for first principles, first-principles review, Occam's razor, or when a complex decision has ambiguous goals, competing constraints, repeated fixes, fallback growth, duplicate owners, or architecture/product direction risk.
Use when the user asks to create, write, update, amend, supersede, or evaluate an ADR, architecture decision record, durable architecture decision, decision log, or baseline sync after architecture-changing work.
Use when explicitly requesting an independent code review, after subagent-driven implementation slices, before merging high-risk work, or when verification finds evidence, baseline, architecture, compatibility, or retirement uncertainty that needs reviewer scrutiny.
Use when starting a turn or checking Aegis skill routing.
| name | verification-before-completion |
| description | Use when about to claim work is complete, fixed, passing, verified, release-ready, or ready to commit, merge, publish, or hand off. |
→ About to claim "done", "passing", "fixed", "complete"? → Run the verification command first. Then claim.
Claiming work is complete without verification is dishonesty, not efficiency. Evidence before claims, always.
Before ANY success/completion claim, expression of satisfaction, commit, PR, task completion, or delegation. Applies to exact phrases, paraphrases, and implications.
Before any success claim, include the required evidence semantic slots. Natural prose, localized headings, or compact cards are all valid when the slots remain explicit and auditable.
Required evidence slots, one allowed card rendering:
- Evidence action / check performed:
- Result / exit status:
- Covered scope:
- Uncovered scope:
- Residual risk:
- Confidence grade: A | B | C
Semantic Slots:
Remove/Restore: side effects? temp instrumentation restored?
Evidence Bundle: exact command, scope, exit status, key output. State what's covered and what's not. Include target test and related regression evidence. When automation is blocked, provide reproducible manual verification steps.
Prompt Hygiene: when external output shaped judgment → state whether summaries or raw excerpts were used. Name large payloads not loaded. If summary insufficient → read back excerpt or lower claim. Include Evidence Used / Not Loaded / Next Evidence boundary when relevant.
Confidence: A (direct + regression, no unknowns) | B (direct, bounded risk) | C (partial only, not closed)
Authority: verified evidence ≠ authoritative completion. Keep distinct.
Goal Closure: when goal-framing or optional TaskIntentDraft goal
fields shaped the work, explicitly check the goal before claiming completion:
Goal Closure:
- Goal status: satisfied | blocked | needs-verification | scope-exceeded
- Success evidence:
- Stop state: done | blocked | needs-verification | scope-exceeded
- Non-goals respected:
Use done only when success evidence is satisfied. Use blocked when a
dependency, permission, or required fact is missing. Use needs-verification
when implementation exists but evidence is insufficient. Use
scope-exceeded when continuing would exceed the goal or non-goals.
Long-Task: re-read checkpoint, confirm every todo has status, no drift check unresolved.
Workspace Integrity: if the task created or modified a target project's
docs/aegis/ workspace and configured Aegis workspace support is available,
run
python <aegis-workspace-helper> bundle --root <target-project-root> --work YYYY-MM-DD-<slug>
when a work/ record exists, then run
python <aegis-workspace-helper> check --root <target-project-root> and
include the result in the evidence bundle. The generated proof bundle and
workspace check validate method-pack structure, index coverage, and
recognizable JSON artifact sidecars only; they do not judge evidence
sufficiency and do not grant completion authority.
Readiness Summary: for release, merge, handoff, or "ready?" requests, organize the evidence into a compact readiness view after the evidence slots:
Readiness Summary:
- Tests:
- Docs:
- Version:
- Host compatibility:
- Uncovered scope:
- Residual risk:
A readiness summary is advisory evidence organization only. It is not authorization to commit, tag, publish, merge, or release. It cannot provide completion authority.
Natural Aegis closeout: when Aegis skills materially shaped a
non-trivial task, use one sentence before the final claim to name the
boundary or quality risk Aegis held steady. Do not default to a visible Aegis Contribution Note: heading.
Keep it advisory method-pack discipline, not completion authority. Keep it
implicit for obvious fast-path replies unless the user asked about Aegis
routing.
This judgment used Aegis to hold one boundary steady: <boundary / quality risk>.
Natural expression may satisfy the visibility requirement when the semantic slots are still explicit. For example, "I will follow the Aegis order here: read the owner / baseline and current implementation first, add a failing example for the main path, then make the minimal repair and verify it" is a valid natural transition before implementation. Completion still needs fresh evidence and the applicable Governance Receipt fields.
Use structured trace only for audit, debug, release, long-task review, or user request. The structured form may name skills, stage transitions, quality effect, and boundary, but it should not replace the normal user-facing completion note.
User-Language Output: final response cards must localize user-facing section labels, field labels, and explanatory prose to the user's language. Keep commands, file paths, code identifiers, stable enum values, and exact product names unchanged. For important Aegis product terms, include the stable English identifier only when it prevents ambiguity, usually beside a user-language explanation on first use.
Complexity Delta: for non-trivial code changes, inspect the actual diff before claiming completion. This is a completion-time entropy check, not a universal failure gate. Skip or keep it one-line for tiny wording edits, tests-only additions, generated files, vendored files, fixtures, lockfiles, or purely mechanical formatting where no maintained source owner gained complexity.
Use the project language for field labels in the final response, but keep the internal shape recognizable:
Complexity Delta:
- Files over 800 lines:
- Files newly crossing 800 lines:
- Largest touched file delta:
- Largest touched function/block:
- New branches/fallbacks/adapters:
- Retired branches/fallbacks/adapters:
- Net entropy: decreased | stable | increased-with-justification
- Required follow-up:
When the delta finds meaningful pressure, add:
Complexity Governance Suggestion:
- Recommendation: none | monitor | schedule-refactor | extract helper | split owner | open follow-up
- Why:
- Suggested scope:
- Timing:
Use none for small owner-correct diffs, monitor for acceptable visible
growth, and stronger recommendations for 800+ line files, 80+ line blocks,
branch/fallback/adapter growth without retirement, or owner mismatch. The
suggestion is advisory; keep residual risk visible.
Rules:
GateDecision, and not completion
authority.Product / Requirement Baseline covers the accepted problem, success
evidence, non-goals, workflow constraints, and approved requirement/spec
intent. Architecture / Runtime Boundary Baseline covers canonical owner,
contract, source-of-truth, dependency direction, compatibility,
runtime-ready/method-pack boundary, and retirement state.
Triggering surfaces include architecture, contracts, source-of-truth owner, canonical owner, context/answering/runtime flow, cross-module data flow, producer-to-carrier-to-consumer chains, public user-visible identity, evidence model, retained fallback, adapter, compatibility path, requirement acceptance, product non-goals, and project-specific baseline rules.
Baseline Alignment:
- Trigger: yes | no
- Product / Requirement Baseline:
- Architecture / Runtime Boundary Baseline:
- Requirement / acceptance alignment:
- Architecture / owner / contract alignment:
- Result: aligned | Design Defect | Implementation Drift | missing-authority | needs-clarification
- scope: requirements | architecture | both
- Evidence:
- Residual risk:
Use Design Defect when the relevant requirement, design, or baseline is
wrong. Use Implementation Drift when the work deviates from a correct
unchanged baseline. Architecture Defect and Architecture Drift remain
compatibility aliases for architecture-scoped Design Defect and
architecture-scoped Implementation Drift.
When project instructions specifically require architecture reporting or the
completed work touched durable architecture surfaces, the architecture-scoped
subset may also be reported as Architecture Alignment:
Architecture Alignment:
- Trigger: yes | no
- Scope:
- Baseline checked:
- Result: aligned | Design Defect | Implementation Drift | missing-authority | needs-clarification
- Evidence:
- Integrity Residual Risk:
- Residual architecture risk:
Use Integrity Residual Risk when ArchitectureReviewRequired: yes, an
Architecture Integrity Lens shaped the plan or review, or the diff touches
canonical owner, source-of-truth, fallback, adapter, or duplicate-owner
surfaces. Name any unresolved responsibility overlap, missed higher-level
owner / contract fix, retained caller-side fallback, or stale path that still
needs retirement. If none remains, state none rather than expanding into a
new gate.
Trigger: no or skip the expanded block for simple
wording edits, ordinary README cleanup, routine release-note edits, low-risk
single-file changes, tests-only coverage improvements, and bug fixes that
only restore the existing baseline. This is a method-pack signal, not a
runtime gate, not an authoritative GateDecision, and not completion
authority.Durable architecture surfaces include canonical owner, public API/schema, artifact shape, behavior contract, dependency direction, source-of-truth owner, host compatibility strategy, install/discovery contract, method-pack/runtime-core boundary, runtime-ready artifact boundary, evidence model, retained fallback, adapter, compatibility path, duplicate owner, retirement schedule, accepted architecture-scoped Implementation Drift, and release/distribution strategy that future contributors would otherwise misread.
ADR Backfill Check:
- Trigger: yes | no
- Suggested action: create | amend | supersede | skip
- Evidence source:
- Baseline sync: needed | not-needed | unknown
- Skip reason:
- Boundary: advisory method-pack signal only
If the suggested action is create, amend, or supersede, or if Baseline sync
is needed or unknown, use recording-architecture-decisions for the ADR
lifecycle and Baseline Sync Closure before making the final completion
claim. This keeps verification-before-completion as the completion owner
while delegating the ADR/baseline writeback decision to the dedicated skill.
Repair Track: repaired object | action | impact | verification
Retirement Track: retired object | action | retained boundary | future trigger
Residual Risk: unverified | deferred
For work that adds, replaces, or retains old logic, also make the delete-first closure explicit:
Retirement Closure:
- Old logic located:
- Deleted:
- Retained:
- Retention reason:
- Retirement trigger:
- Lingering references checked: