with one click
autoresearch
// Autonomous iteration loop: modify, verify, keep/discard against any metric
// Autonomous iteration loop: modify, verify, keep/discard against any metric
| name | autoresearch |
| description | Autonomous iteration loop: modify, verify, keep/discard against any metric |
| version | 2.1.0 |
Iterations: unlimited.autoresearch/{subcommand}-{YYMMDD}-{HHMM}/ directory.handoff.json. Evals reads *-results.tsv.| Command | Does | Default Iterations |
|---|---|---|
$autoresearch | Iterate against a metric: modify → verify → keep/discard | 25 |
$autoresearch plan | Convert a goal into validated Scope, Metric, Verify config | N/A |
$autoresearch debug | Hunt bugs: hypothesize → test → falsify → repeat | 15 |
$autoresearch fix | Crush errors one-by-one until zero remain | 20 |
$autoresearch security | STRIDE + OWASP audit with red-team personas | 15 |
$autoresearch ship | Ship through 8 phases: checklist → dry-run → deploy → verify | N/A |
$autoresearch scenario | Generate edge cases across 12 dimensions | 20 |
$autoresearch predict | 5 expert personas debate before implementation | N/A |
$autoresearch learn | Scout codebase → generate docs → validate → fix loop | 10 |
$autoresearch reason | Adversarial debate with blind judges until convergence | 8 |
$autoresearch probe | 8 personas interrogate requirements until saturation | 15 |
$autoresearch improve | Research ICP challenges, discover improvements, generate PRDs | 15 |
$autoresearch evals | Analyze iteration results: trends, plateaus, regressions | N/A |
| Flag | Applies To | Purpose |
|---|---|---|
Iterations: N | All looping | Set iteration count |
Iterations: unlimited | All looping | Opt-in unbounded |
--evals | All looping | Mid-loop checkpoints + final summary |
--evals-interval N | All looping | Override checkpoint frequency |
--chain <targets> | All | Sequential handoff after completion |
--<subcommand> | All | Shorthand for --chain <subcommand> |
Grilling session that challenges your plan against the existing domain model, sharpens terminology, and updates documentation (CONTEXT.md, ADRs) inline as decisions crystallise. Use when user wants to stress-test a plan against their project's language and documented decisions.
Implements Manus-style file-based planning to organize and track progress on complex tasks. Creates task_plan.md, findings.md, and progress.md. Use when asked to plan out, break down, or organize a multi-step project, research task, or any work requiring 5+ tool calls. Supports automatic session recovery after /clear.
Find deepening opportunities in a codebase, informed by the domain language in CONTEXT.md and the decisions in docs/adr/. Use when the user wants to improve architecture, find refactoring opportunities, consolidate tightly-coupled modules, or make a codebase more testable and AI-navigable.
Capture a full DevTools-protocol trace of any browser automation — CDP firehose, screenshots, and DOM dumps — then bisect the stream into per-page searchable buckets. Use when the user wants to debug a failed run, audit network/console/DOM activity, attach a trace to an in-progress session, or feed structured per-page summaries back into an agent loop so its next iteration learns from the last one.
Kimi WebBridge lets AI control the user's real browser — navigate, click, type, read, screenshot, and interact with any website using the user's actual login sessions. Use this skill whenever the user wants to interact with websites, automate browser tasks, scrape web content, or perform any action requiring a real browser. Also use when the user mentions "browser", "webpage", "open URL", "screenshot", or asks to read/interact with any website. Use even for simple-sounding browser requests — the daemon handles all complexity.
Ultra-compressed communication mode. Cuts token usage ~75% by speaking like caveman while keeping full technical accuracy. Supports intensity levels: lite, full (default), ultra, wenyan-lite, wenyan-full, wenyan-ultra. Use when user says "caveman mode", "talk like caveman", "use caveman", "less tokens", "be brief", or invokes /caveman. Also auto-triggers when token efficiency is requested.