// Use when designing deployment pipelines, CI/CD, terraform, or infrastructure automation. Enforces rollback checkpoint then TodoWrite with 19+ items. Triggers: "deploy", "CI/CD", "kubernetes", "terraform". If thinking "rollback later" - use this first.
MANDATORY before creating TodoWrite:
Why first: 27% skip rollback when checkpoint appears later.
CREATE TodoWrite with 4 sections (19+ items total):
| Section | Min Items | Order |
|---|---|---|
| Automation | 5+ | 1st |
| Observability | 5+ | 2nd (BEFORE Failure Recovery) |
| Failure Recovery | 5+ | 3rd (requires Observability) |
| Verification | 4+ | 4th |
Section order matters: You cannot define failure recovery without observability to detect failures.
After creating TodoWrite, verify 3 random items:
Each item must have ALL THREE:
| ❌ FAILS | ✅ PASSES |
|---|---|
| "Add monitoring" | "CloudWatch: deployment.duration_seconds, Grafana dashboard at /dashboards/deployments, PagerDuty alert if error rate > 5% for 3min" |
| "Implement rollback" | "Rollback .github/workflows/rollback.yml reverts to previous Docker tag from S3 deployment-history/latest-stable.txt. Triggers: manual OR error rate > 5% for 3min. Target: < 5 minutes. Test staging on [date]" |
/health returns 200 + dependency status)| Thought | Reality | Data |
|---|---|---|
| "Manual deploy is broken, need automation fast" | Automating without rollback creates WORSE problems | 27% skip rollback |
| "We'll add monitoring/rollback after" | Can't detect/recover from failures without them | 80% never add "later" |
| "Rollback is overkill" | Manual recovery ALWAYS takes 10x longer | 30+ min manual vs 2 min automated |
| "We can manually revert" | Detect (no monitoring) + find version (no automation) + apply (error-prone) | 30+ min |
❌ BLOCKED: Cannot deploy without rollback capability.
Required to override:
To skip ANY requirement, provide ALL 4:
[ ] 19+ items across 4 sections
[ ] 80%+ items have concrete numbers
[ ] 80%+ items name specific tools
[ ] 100% items have measurable outcomes
[ ] 3 random items pass specificity test
[ ] Rollback checkpoint completed
[ ] Observability BEFORE Failure Recovery (correct order)
Grade 7+/8: Ready to proceed
Grade <7: Revise TodoWrite
Before marking complete: