con un clic
tdd-evidence-workflow
TDD-driven evidence workflow for generating authoritative failure/fix proof in PRs.
Menú
TDD-driven evidence workflow for generating authoritative failure/fix proof in PRs.
Multi-PR triage and parallel work dispatcher. Prevents single-PR tunnel vision by enforcing a survey-before-deep-dive protocol.
Classifies GitHub issues/PRs into PR-type categories and recommends autor techniques. Used by packages/core/src/decomposer.ts.
Use this skill when working in repositories managed by Agent Orchestrator or when the user asks how to use `ao` properly. Covers the default AO workflow: bootstrap with `ao start`, dispatch work with `ao spawn`, inspect progress with `ao status` or `ao session ls`, steer sessions with `ao send`, and recover or clean up sessions safely.
Canonical 7-green PR merge criteria, PR status check pattern, PR freeze discipline, and admin merge protocol
Define and run skeptic exit criteria for non-trivial tasks — independent verification agent with inverted incentive to find gaps
How to record asciinema/tmux evidence videos that prove work was done correctly
| name | tdd-evidence-workflow |
| description | TDD-driven evidence workflow for generating authoritative failure/fix proof in PRs. |
This skill operationalizes Test-Driven Development (TDD) as the primary mechanism for generating authentic, authoritative evidence for pull requests.
Authentication of a fix requires proving that the bug or gap existed immediately before the fix was applied. This is achieved by capturing the "Red" phase of TDD.
All AO workers and agents MUST follow these steps for every bug fix or feature implementation:
AssertionError).initial_failure or red_phase in your evidence bundle.PASS.final_verification or green_phase in your evidence bundle.Your ## Evidence section in the PR MUST include both phases:
## Evidence
### TDD Cycle Proof
- **Initial Failure (Red)**: [repro_failure.log](URL) or [repro_video.mp4](URL)
- **Final Verification (Green)**: [verify_success.log](URL) or [verify_video.mp4](URL)
### Summary
| Phase | Result | Artifact |
|-------|--------|----------|
| Red | FAIL | logs/initial.log |
| Green | PASS | logs/final.log |
Screenshots are point-in-time snapshots that can be easily faked or misinterpreted. Video captures the temporal progression of the fix, especially when showing the "failed state" transitioning to the "fixed state" in the same or sequential recordings.