| name | sidecar-task-runner |
| description | Run artifact-driven sidecar agent tasks through one-shot Codex CLI sessions. Use when a main agent should delegate bounded scans, drafts, audits, pre-reviews, or mechanical repo tasks to a fast isolated sidecar model such as gpt-5.3-codex-spark while keeping final decisions with the main agent. |
| allowed-tools | Read, Write, Edit, Bash, Glob |
Sidecar Task Runner
Use this skill to run bounded helper tasks from a main agent without sharing the main conversation as the task state. The sidecar produces artifacts; the main agent verifies, integrates, rejects, or escalates them.
Skill Directory Layout
<installed-skill-dir>/
āāā SKILL.md
āāā templates/
ā āāā precommit-classifier.md
ā āāā personalization-scanner.md
āāā scripts/
āāā prepare_sidecar_task.py
Core Contract
- The main agent owns planning, integration, final judgment, and user-facing actions.
- The sidecar gets a small prompt artifact, not the main chat transcript.
- Default to
gpt-5.3-codex-spark for fast local scans, first drafts, pre-reviews, consistency checks, and other low/medium-risk tasks.
- Prefer deterministic scripts and direct shell commands for facts; use the sidecar for synthesis, ranking, summarization, mismatch detection, and candidate text.
- Default sandbox is
read-only. Use workspace-write only when the sidecar must write its own artifacts or make explicitly bounded edits.
- Keep human gates before external or irreversible actions such as
git tag, git push, release upload, job submission, or public issue creation.
- Do not use a sidecar as the only reviewer for high-risk algorithm design, core correctness, security/privacy, final merge approval, or final submission decisions.
Artifact Layout
Use a repo-local directory:
.agent/sidecars/<task-id>/
āāā prompt.md
āāā input-manifest.md
āāā output.md
āāā findings.md
āāā decision.md
āāā model.md
āāā model.json
Meanings:
prompt.md: exact prompt given to the sidecar.
input-manifest.md: files, commands, diffs, or project state the sidecar may inspect.
output.md: raw sidecar final response, usually written with codex exec -o.
findings.md: distilled findings if the sidecar writes structured results.
decision.md: main-agent decision after reading the sidecar output.
model.md: human-readable command, model, sandbox, and token notes.
model.json: structured run metadata for token and lifecycle audits.
Prepare A Task
Create the artifact directory with:
python3 <installed-skill-dir>/scripts/prepare_sidecar_task.py \
--repo . \
--task-id <task-id> \
--title "<short task title>" \
--phase tooling \
--task-type audit \
--prompt-file <prompt-file>
If no prompt file exists, pass --prompt "<task instructions>".
For a fast precommit classification sidecar, use the bundled preset:
python3 <installed-skill-dir>/scripts/prepare_sidecar_task.py \
--repo . \
--title "Precommit classifier" \
--phase maintenance \
--task-type audit \
--preset precommit-classifier \
--input "git status --short" \
--input "git diff"
For a personalization scan that extracts reusable preferences from sanitized artifacts without asking the user:
python3 <installed-skill-dir>/scripts/prepare_sidecar_task.py \
--repo . \
--title "Personalization scan" \
--phase maintenance \
--task-type audit \
--preset personalization-scanner \
--input "memory/current-status.md" \
--input ".agent/sidecars/*/decision.md"
Run Codex Spark
For a read-only sidecar:
codex exec --ephemeral \
-m gpt-5.3-codex-spark \
-C . \
-s read-only \
-o .agent/sidecars/<task-id>/output.md \
"$(cat .agent/sidecars/<task-id>/prompt.md)"
For a sidecar that must write its own findings.md or make tightly scoped artifact edits:
codex exec --ephemeral \
-m gpt-5.3-codex-spark \
-C . \
-s workspace-write \
-o .agent/sidecars/<task-id>/output.md \
"$(cat .agent/sidecars/<task-id>/prompt.md)"
Avoid codex resume, codex fork, claude --continue, or claude --resume for first-pass sidecar work. Those are continuation mechanisms, not clean sidecar boundaries.
Prompt Requirements
Every sidecar prompt should state:
- task goal and non-goals
- files or directories it may inspect
- whether code edits are allowed
- required output path and format
- escalation criteria for strong reviewer or main-agent judgment
- privacy boundary: do not copy raw private logs or unrelated conversation text into repo artifacts
Keep prompts narrow. If the task needs several independent analyses, create separate sidecar runs rather than one broad prompt.
Model Routing
Good Spark sidecar tasks:
- first-pass code review before a strong review
- precommit path classification: Fast Path, Skill Path, Code Path, or Risk Path
- personalization scans over trajectories, sidecar artifacts, diffs, and memory to propose reusable preferences
- git milestone proposal from recent commits and docs
- README / AGENTS / CLAUDE consistency scan
- test-gap scan after a focused implementation
- experiment log triage and config mismatch inventory
- evidence inventory across CSVs, reports, tables, and figures
- reviewer-comment clustering and action-item extraction
- citation placeholder or TODO scans
Use the main agent or a stronger fresh reviewer for:
- core algorithm design decisions
- final code approval for high-risk changes
- security, privacy, or data-governance decisions
- large refactors with unclear blast radius
- final paper positioning, rebuttal strategy, or submission judgment
Precommit Classifier Contract
Use the precommit-classifier preset when a commit/push closeout would otherwise be slowed down by deciding which gates to run. The sidecar inspects only read-only Git state and affected public repo files, then recommends:
- the commit path: Fast Path, Skill Path, Code Path, or Risk Path
- the minimal validation commands
- whether reinstall is needed
- whether reinstall can be limited to specific changed skills
The main agent must still stage files, commit, push, reinstall, and report the outcome. A sidecar must not perform external or irreversible actions.
Personalization Scanner Contract
Use the personalization-scanner preset when a main agent wants low-cost automatic memory writeback candidates from trajectories or repo artifacts. The sidecar may inspect only explicitly listed inputs and returns candidate preferences with scope, confidence, evidence, suggested target, and privacy notes.
The main agent must still decide what to write, keep private facts out of public repo memory, and perform any project-memory or skill-repo edits. The sidecar must not ask the user, quote raw logs, or promote public skill rules by itself.
Token Telemetry
Because --ephemeral may avoid normal persisted session logs, record run metadata in model.json. If Codex CLI reports token usage in the terminal, copy the numeric fields into model.json or model.md before running token-usage-auditor.
If exact usage is unavailable, leave usage values as null and record:
Token usage: unavailable from ephemeral run output.
Never store raw prompts from private chat logs just to recover token counts.