verification-before-completion
verification-before-completion enforces evidence-based completion claims by requiring execution of proof commands before declaring work done, passing, fixed, or ready to merge. Use this skill whenever about to claim success, express satisfaction, commit code, or hand off work to prevent false completeness assertions and ensure all completion claims include verification output, scope coverage, residual risk, and confidence grading.
git clone --depth 1 https://github.com/GanyuanRan/Aegis /tmp/verification-before-completion && cp -r /tmp/verification-before-completion/skills/verification-before-completion ~/.claude/skills/verification-before-completionSKILL.md
# Execute
→ About to claim "done", "passing", "fixed", "complete"? → **Run the verification command first. Then claim.**
1. Identify: what command proves the claim?
2. Run: full command, fresh, complete
3. Read: output, exit code, failures
4. Verify: output confirms claim? → state claim WITH evidence. Doesn't? → state actual status.
→ Done when: exact command run, output confirms, residual risk stated, confidence graded.
Non-trivial code changes → also report Complexity Delta, Complexity Governance Suggestion, and Major Complexity Alert when triggered.
Governance/retirement work → also close Repair Track + Retirement Track + Residual Risk.
# Verification Before Completion
## Overview
Claiming work is complete without verification is dishonesty, not efficiency. Evidence before claims, always.
## Red Flags - STOP
- Using "should", "probably", "seems to"
- Expressing satisfaction before verification ("Great!", "Perfect!", "Done!", etc.)
- About to commit/push/PR without verification
- Trusting agent success reports
- Relying on partial verification
- Thinking "just this once"
- Tired and wanting work over
- **ANY wording implying success without having run verification**
## When To Apply
Before ANY success/completion claim, expression of satisfaction, commit, PR, task completion, or delegation. Applies to exact phrases, paraphrases, and implications.
## QA Closure
Before any success claim, include the required evidence semantic slots. Natural
prose, localized headings, or compact cards are all valid when the slots remain
explicit and auditable.
```text
Required evidence slots, one allowed card rendering:
- Evidence action / check performed:
- Result / exit status:
- Covered scope:
- Uncovered scope:
- Residual risk:
- Confidence grade: A | B | C
```
Semantic Slots:
- Required governance fields may appear as localized headings, natural prose, or
compact cards when they remain explicit and auditable.
- Natural Surface is valid when natural user-facing wording preserves the
semantic slots; natural expression is not a reason to drop evidence,
uncovered scope, residual risk, confidence, retirement, baseline, or
architecture fields.
- In short: natural expression is valid only when it preserves semantic slots.
- Governance Receipt is the compact closeout form for Aegis-shaped non-trivial
work. It names the boundary held, evidence, covered and uncovered scope,
residual risk, confidence, and any triggered governance closure.
TDD Completion Boundary:
- Judge the completion claim against the highest available explicit boundary.
- Parent plan/spec acceptance decides whole-task completion; `TaskIntentDraft`
decides current-task completion; `Slice Card` decides slice completion only.
- Match the boundary to the claim being made, and keep any higher open boundary
explicit.
- If only slice-level evidence exists, do not claim whole-task `done`.
- If no explicit boundary exists and atomicity is not clear, downgrade to
`needs-verification` or return to framing/planning.
1. **Remove/Restore**: side effects? temp instrumentation restored?
2. **Evidence Bundle**: exact command, scope, exit status, key output. State what's covered and what's not. Include target test and related regression evidence. When automation is blocked, provide reproducible manual verification steps.
3. **Prompt Hygiene**: when external output shaped judgment → state whether summaries or raw excerpts were used. Name large payloads not loaded. If summary insufficient → read back excerpt or lower claim. Include Evidence Used / Not Loaded / Next Evidence boundary when relevant.
4. **Confidence**: A (direct + regression, no unknowns) | B (direct, bounded risk) | C (partial only, not closed)
5. **Authority**: verified evidence ≠ authoritative completion. Keep distinct.
6. **Goal Closure**: when `goal-framing` or optional `TaskIntentDraft` goal
fields shaped the work, explicitly check the goal before claiming completion:
Available boundary check for completion judgment:
1. Parent plan/spec acceptance for whole-task completion, when present
2. `TaskIntentDraft` Goal / Success evidence / Non-goals for the current
task, when present
3. `Slice Card` Goal / Verification / Stop for the current slice, when
present
```text
Goal Closure:
- Goal status: satisfied | blocked | needs-verification | scope-exceeded
- Success evidence:
- Stop state: done | blocked | needs-verification | scope-exceeded
- Non-goals respected:
```
Use `done` only when success evidence is satisfied; `blocked` for missing
dependency/permission/fact; `needs-verification` for insufficient evidence;
`scope-exceeded` when continuing would exceed goal or non-goals.
7. **Long-Task**: re-read checkpoint, confirm every todo has status, no drift check unresolved.
8. **Workspace Integrity**: if the task created or modified a target project's
`docs/aegis/` workspace and configured Aegis workspace support is available,
run
`python <aegis-workspace-helper> bundle --root <target-project-root> --work YYYY-MM-DD-<slug>`
when a `work/` record exists, then run
`python <aegis-workspace-helper> check --root <target-project-root>` and
include the result in the evidence bundle. The generated proof bundle and
workspace check validate method-pack structure, index coverage, and
recognizable JSON artifact sidecars only; they do not judge evidence
sufficiency and do not grant completion authority.
9. **Readiness Summary**: for release, merge, handoff, or "ready?" requests,
organize the evidence into a compact readiness view after the evidence
slots:
```text
Readiness Summary:
- Tests:
- Docs:
- Version:
- Host compatibility:
- Uncovered scope:
- Residual risk:
```
Advisory only. It does not authorize commit, tag, publish, merge, or release, and it does not provide completion authority.
10. **Natural Aegis closeout**: when Aegis skills materia|
Deprecated - use the aegis:brainstorming skill instead
Deprecated - use the aegis:executing-plans skill instead
Deprecated - use the aegis:writing-plans skill instead
Use when retiring old logic, collapsing duplicate owners, removing fallbacks, or touching schema, persistence, or source-of-truth boundaries while deciding whether to delete old paths, retain compatibility, or stop for confirmation.
Use when defining new features, product behavior, UI/component design, architecture choices, contract changes, or ambiguous medium/high-complexity work before implementation.
Use when the user asks for caveman mode, fewer tokens, brief responses, compressed communication, or otherwise explicitly requests a much shorter answer.
Use when facing 2+ independent tasks that can be worked on without shared state or sequential dependencies