Thorough analysis of code, architecture, or topics — produces a structured research report. Supports goal-based modes: Explore (general research), Audit (risk-focused critique), Improve (actionable suggestions), Custom (user-defined lens). Triggers: "analyze this code", "deep dive into", "research this topic", "investigate how X works", "audit this", "critique this", "suggest improvements", "find risks in".
Structured ideation and trade-off analysis for design and architecture decisions. Triggers: "brainstorm ideas", "explore this problem", "think through trade-offs", "challenge assumptions", "discuss architecture".
Executes routine maintenance and cleanup tasks from a structured task queue. Triggers: "do some chores", "housekeeping tasks", "small cleanup tasks", "work through a task queue".
Drives feature implementation following structured development protocols. Triggers: "implement this feature", "build this", "write the code", "TDD implementation", "execute the plan".
Iterative prompt and schema refinement using TDD methodology for LLM workloads. Triggers: "refine the prompt", "improve extraction", "iterate on schema", "prompt TDD", "tune the model".
Reviews and validates work across sessions for consistency and correctness. Triggers: "review session work", "validate debriefs", "approve session reports", "end-of-day review".
Designs and writes test cases for code correctness and regression prevention. Supports goal-based modes: Coverage (gap-filling), Hardening (edge cases & stress), Integration (boundaries & contracts), Custom (user-defined lens). Triggers: "write tests", "design test cases", "verify edge cases", "catch regressions", "test this feature", "harden this", "integration tests".
Keeps documentation in sync with code changes and project state. Triggers: "update documentation", "patch the docs", "sync docs with code changes", "update architecture docs".