بنقرة واحدة
run
// Use the local Autodialectics MCP server and CLI to compile tasks, execute anti-slop runs, inspect artifacts, replay runs, benchmark policies, and evolve champions in this repository.
// Use the local Autodialectics MCP server and CLI to compile tasks, execute anti-slop runs, inspect artifacts, replay runs, benchmark policies, and evolve champions in this repository.
Run benchmark suites and manage policy evolution — create challengers, compare against champions, promote or rollback policies.
Examine Autodialectics run results, manifests, and stored artifacts. Use after a pipeline run completes or to review past runs.
Re-run a stored Autodialectics pipeline run, optionally with a different policy, to compare outcomes or debug regressions.
Compile and execute an Autodialectics anti-slop pipeline for a task. Covers health checks, runtime init, contract compilation, and full pipeline execution.
| name | run |
| description | Use the local Autodialectics MCP server and CLI to compile tasks, execute anti-slop runs, inspect artifacts, replay runs, benchmark policies, and evolve champions in this repository. |
Use this skill when the user wants to work through the Autodialectics harness instead of asking the model to freestyle a task.
Repository facts:
autodialectics.yamluv run autodialectics-mcpuv run autodialectics.venv/bin/python -m autodialectics.cli.mainPreferred MCP workflow:
healthinit_runtimecompile_taskrun_taskinspect_run or read_artifactbenchmark, evolve_policy, promote_policy, rollback_policy, or replay_runCLI fallback command forms:
uv run autodialectics init
uv run autodialectics compile examples/code_fix/task.json
uv run autodialectics run examples/code_fix/task.json
uv run autodialectics benchmark
uv run autodialectics inspect <run_id>
uv run autodialectics replay <run_id>
uv run autodialectics evolve
uv run autodialectics promote <policy_id>
uv run autodialectics rollback
Working rules:
uv run autodialectics ... over ad hoc module invocation.--config autodialectics.yaml if config resolution looks ambiguous.artifacts/run_* directory and summarize the gate decision, score, slop composite, and unresolved risks.Expected examples:
inspect after run or benchmark instead of guessing from stdout.replay when the user wants the same task rerun under another policy.