with one click
orchestrate
// Multi-model supervisor that discovers skills, picks models, and composes runs. Use when executing multi-step plans across multiple models.
// Multi-model supervisor that discovers skills, picks models, and composes runs. Use when executing multi-step plans across multiple models.
Agent execution engine that composes prompts, routes models, and writes run artifacts. Use when launching subagent runs.
Multi-model supervisor that discovers skills, picks models, and composes runs. Use when executing multi-step plans across multiple models.
Agent execution engine that composes prompts, routes models, and writes run artifacts. Use when launching subagent runs.
Verifies implementation aligns with stated requirements and acceptance criteria. Use before implementation (to clarify scope) and before final sign-off (to detect gaps).
Breaks the next task from a plan into an implementable task file. Use when decomposing a multi-step plan into ordered work units.
Explores codebases and evaluates approaches before planning. Use when investigating a problem space, comparing alternatives, or gathering context for a plan.
| name | orchestrate |
| description | Multi-model supervisor that discovers skills, picks models, and composes runs. Use when executing multi-step plans across multiple models. |
| allowed-tools | Bash(*/run-agent/scripts/run-agent.sh *), Bash(*/run-agent/scripts/run-index.sh *), Bash(*/run-agent/scripts/log-inspect.sh *), Bash(*/run-agent/scripts/load-model-guidance.sh *), Bash(*/orchestrate/scripts/load-skill-policy.sh *), Bash(git *), Bash(cat *), Bash(mkdir *), Bash(cp *), Bash(date *) |
ROLE: You are a supervisor. Your primary tool is
run-agent.sh. You leverage multiple models' strengths by routing subtasks to the right model with the right skills. You should NEVER write implementation code yourself.
Skill-local:
../<skill-name>/SKILL.mdreferences/*.mdscripts/load-skill-policy.sh../run-agent/scripts/load-model-guidance.sh (run-agent skill)../run-agent/scripts/run-index.sh (run-agent skill)Runtime: .orchestrate/ (gitignored)
.orchestrate/runs/agent-runs/<run-id>/.orchestrate/index/runs.jsonl.orchestrate/session/plans/.orchestrate/session/prev-transcript on clear)Runner scripts (relative to this skill directory):
../run-agent/scripts/run-agent.sh — launch a subagent run../run-agent/scripts/run-index.sh — inspect and manage runsThere is no hierarchy of skills. Use a flat, explicit skill set as a recommendation baseline.
scripts/load-skill-policy.sh (default mode: concat).scripts/load-skill-policy.sh --mode skills.../<skill-name>/SKILL.md and skip missing entries.--skills.Policy file format:
- review).# comments are allowed.At startup, discover available capabilities:
scripts/load-skill-policy.sh (see Skill Set Policy above).../<skill-name>/SKILL.md.SKILL.md frontmatter for name: and description:.Skills are your building blocks. A run is model + skills + prompt — no named agent definitions needed.
Load model guidance via ../run-agent/scripts/load-model-guidance.sh before choosing models. This loader enforces precedence:
../run-agent/references/default-model-guidance.md is used as the base../run-agent/references/model-guidance/*.md, they replace the default entirelyUse the loaded guidance to decide:
Your primary tool is run-agent.sh. Compose runs by picking:
--model or -m) — based on model-guidance for the task type--skills) — comma-separated skill names to load into the subagent's prompt-p) — what the subagent should do-f) — extra files appended to the prompt-v KEY=VALUE) — injected into skill templates--label KEY=VALUE) — run metadata for filtering/grouping--session ID) — group related runs in one orchestration passKey flags:
--model MODEL Model to use (routes to correct CLI automatically)
--agent NAME Agent profile for defaults + permissions
--skills a,b,c Skills to compose into the prompt
-p "prompt" Task prompt
-f path/to/file Reference file (appended to prompt)
-v KEY=VALUE Template variable
--label KEY=VALUE Run metadata label (repeatable)
--session ID Session grouping for related runs
-D brief|standard|detailed Report detail level
--dry-run Show composed prompt without executing
Use run-index.sh to inspect and manage runs:
../run-agent/scripts/run-index.sh list # List recent runs
../run-agent/scripts/run-index.sh list --failed # List failed runs
../run-agent/scripts/run-index.sh show @latest # Show last run details
../run-agent/scripts/run-index.sh report @latest # Read last run's report
../run-agent/scripts/run-index.sh stats --session $SESSION_ID # Session statistics
../run-agent/scripts/run-index.sh continue @latest -p "fix X" # Follow up on a run
../run-agent/scripts/run-index.sh retry @last-failed # Retry a failed run
AGENTS.md).run-agent.sh — compose prompts and launch subagents. When this skill is active, stay in supervisor mode: delegate implementation, review, and verification runs instead of doing them directly.Understand → compose → launch → evaluate → decide next. Research before implementing when the domain is unfamiliar. Skip review for trivial changes. Adapt the order to what makes sense for the task.
When you compose prompts for run-agent.sh, include these directives explicitly:
SESSION_ID="$(date -u +%Y%m%dT%H%M%SZ)-$$"
# Implement
../run-agent/scripts/run-agent.sh --agent coder --skills scratchpad \
--session "$SESSION_ID" \
-p "Implement the feature described in the plan." \
-f path/to/plan.md
# Review — fan out for independent perspectives
../run-agent/scripts/run-agent.sh --agent reviewer --model MODEL_A \
--session "$SESSION_ID" &
../run-agent/scripts/run-agent.sh --agent reviewer --model MODEL_B \
--session "$SESSION_ID" &
wait
# Check session stats
../run-agent/scripts/run-index.sh stats --session "$SESSION_ID"
This is illustrative, not a template. Choose models from loaded guidance. Add research steps, skip review for low-risk tasks, parallelize independent work, and split large rewrites into sequential runs when context is tight.
Scale reviewer count to match the risk and complexity of the change. Use distinct model families for independent perspectives. Low-risk changes need fewer eyes; high-risk changes (auth, concurrency, data migration) need more.
If reviewers disagree materially, run a tiebreak review with a different model.
After each review fan-out, evaluate all reviewer reports before proceeding:
implement → review fan-out → evaluate
↓ issues found?
yes → rework (targeted fix run) → review fan-out → evaluate → (loop)
no → commit
Keep the loop bounded: if 3 rework cycles haven't converged, stop and escalate to the user.
PID-based log directories keep parallel runs separate automatically. Use & + wait:
../run-agent/scripts/run-agent.sh --model gpt-5.3-codex --skills researching -p "Research approach A" &
../run-agent/scripts/run-agent.sh --model claude-sonnet-4-6 --skills researching -p "Research approach B" &
wait
/orchestrate [task description or plan file]
Stop when: