| name | cy-spec-peer-review |
| description | Runs an optional cross-LLM peer review of a TechSpec via compozy exec --ide claude --model opus --reasoning-effort xhigh and requires the reviewer to write a scoped Markdown findings artifact for user-directed incorporation. Use when a TechSpec draft has already been approved by the user and they want an external review round, especially for autonomy/network/memory-impacting designs. Do not use for PRDs, automatic approval gates, code review batches, or auto-looped review cycles. |
| trigger | explicit |
| argument-hint | [techspec-path] |
Spec Peer Review
Codex authors AGH TechSpecs with gpt-5.4 at reasoning_effort=xhigh; Claude Opus pressure-tests them. This skill runs that pressure-test only when the user explicitly asks for a review round after approving the current draft. It does not auto-run, auto-incorporate findings, or auto-loop additional rounds.
The review result is a direct-written Markdown findings file. compozy exec stdout/stderr is operational evidence only; never parse it as the review source of truth.
User Decisions
When this skill instructs the agent to ask whether to incorporate findings or run another round, it MUST use the runtime's dedicated interactive question tool — the tool or function that presents a question to the user and pauses execution until the user responds.
If the runtime does not provide such a tool, present the question as the complete assistant message and stop generating. Do not answer the question on the user's behalf.
Bundled Path Rule
Resolve bundled helper paths relative to the directory that contains this SKILL.md. When invoking the validator from a repository root, use the full repo-relative path:
bash .agents/skills/cy-spec-peer-review/scripts/validate-findings.sh --kind techspec --round <N> --path <findings-path>
The validator is a read-only helper: it inspects the findings artifact and exits non-zero on structural contract violations.
Required Inputs
- techspec-path (optional): explicit path to the
_techspec.md under review. When omitted, resolve to the most recently modified .compozy/tasks/<slug>/_techspec.md whose sibling _meta.md shows Pending: > 0 or no _meta.md exists yet.
Findings Artifact Contract
Each review round has exactly one authoritative findings file:
.compozy/tasks/<slug>/qa/peer-review-findings-roundN.md
The reviewer may write exactly that file and no other file. If the target path is missing, ambiguous, unwritable, or outside the named .compozy/tasks/<slug>/qa/ directory, the reviewer must refuse and stop. It must not print findings to stdout as a fallback.
The findings file MUST use this structure:
---
schema_version: 1
review_kind: techspec
round: N
readiness: READY|BLOCKED|NEEDS_REWORK
reviewer_runtime: claude
reviewer_model: opus
generated_at: <ISO-8601 timestamp>
---
# Summary
# Blockers
# Nits
# Evidence
# Deferred Or Follow-Up
Every blocker and nit must include an ID, a real section/path reference, the issue, and a concrete suggested fix. Blockers must also include the rationale for why the issue blocks approval.
Procedures
Step 1: Validate Input and Context
- Resolve
techspec-path. If omitted, list candidate paths and pick the freshest.
- Confirm the user has already approved the current draft or explicitly asked to review the saved spec as-is.
- Read the spec and confirm it is a final-shape TechSpec (has
Architectural Boundaries, Implementation Steps, Test Strategy sections) — not a draft.
- Read
references/quality-markers.md and verify the spec carries the six markers (boundary statement, listed boundaries, Go interface signatures, data-model field rationale, side-table-vs-JSON decisions, lease/safety invariants enumerated). If any marker is missing, abort and report the missing markers — Opus review is wasted on incomplete specs.
- Resolve the slug from the path; ensure
.compozy/tasks/<slug>/ exists and is writable.
- Ensure
.compozy/tasks/<slug>/qa/ exists before dispatch.
- Determine the next review round number by listing existing
qa/peer-review-findings-round*.md, qa/peer-review-summary-round*.md, and legacy qa/peer-review-result-round*.json* files (prior local output only — not a compatibility path). Start at round1 when none exist.
Step 2: Compose the Review Prompt
- Read
references/peer-review-prompt.md for the canonical executable Opus prompt template. The assembled prompt must start with the reviewer instructions, not with a Markdown wrapper describing the template.
- Define the round artifact paths:
- Findings target:
.compozy/tasks/<slug>/qa/peer-review-findings-roundN.md.
- Operational event log:
.compozy/tasks/<slug>/qa/peer-review-events-roundN.jsonl.
- Operational stderr log:
.compozy/tasks/<slug>/qa/peer-review-result-roundN.err.
- Pre-run status snapshot:
.compozy/tasks/<slug>/qa/peer-review-status-before-roundN.txt.
- Post-run status snapshot:
.compozy/tasks/<slug>/qa/peer-review-status-after-roundN.txt.
- Validation error, only when needed:
.compozy/tasks/<slug>/qa/peer-review-validation-error-roundN.md.
- Substitute the placeholders:
{techspec_path} — exact path to the TechSpec under review.
{adr_paths} — any adrs/*.md siblings, or none.
{related_research} — any analysis/*.md siblings, or none.
{findings_path} — exact absolute path to .compozy/tasks/<slug>/qa/peer-review-findings-roundN.md.
{round} — numeric review round N.
- Write the assembled prompt to
.compozy/tasks/<slug>/qa/peer-review-prompt-roundN.md.
Step 3: Execute the Cross-LLM Review
-
Capture the pre-run status snapshot:
git status --short > .compozy/tasks/<slug>/qa/peer-review-status-before-roundN.txt
-
Run:
compozy exec --ide claude --model opus --reasoning-effort xhigh --format json --prompt-file .compozy/tasks/<slug>/qa/peer-review-prompt-roundN.md > .compozy/tasks/<slug>/qa/peer-review-events-roundN.jsonl 2> .compozy/tasks/<slug>/qa/peer-review-result-roundN.err
-
Capture the post-run status snapshot:
git status --short > .compozy/tasks/<slug>/qa/peer-review-status-after-roundN.txt
-
If the command returns a non-zero exit code, fail loudly. Do not retry silently. Inspect stderr for model misconfiguration (see Error Handling).
-
Treat peer-review-events-roundN.jsonl as operational evidence only. Do not parse it for readiness or findings.
-
Require the findings target file to exist after the command exits. If missing, the round is invalid even when compozy exec exited 0.
Step 4: Validate and Summarize Findings
-
Run the bundled read-only validator:
bash .agents/skills/cy-spec-peer-review/scripts/validate-findings.sh --kind techspec --round N --path .compozy/tasks/<slug>/qa/peer-review-findings-roundN.md
-
Manually inspect the findings file and verify the semantic contract:
- every finding has a real section/path reference;
- blockers include a rationale tied to project rules, lessons, or architecture constraints;
- no
TBD, placeholder text, invented paths, or stdout-only findings;
- comparing the pre/post status snapshots shows no changes outside the expected review artifact/log paths.
-
If validation fails, write .compozy/tasks/<slug>/qa/peer-review-validation-error-roundN.md with the failed checks, command, exit status, and artifact paths. Do not summarize the round as READY.
-
Write .compozy/tasks/<slug>/qa/peer-review-summary-roundN.md from the validated findings file with:
- readiness verdict (
READY / BLOCKED / NEEDS_REWORK);
- one-line rationale per blocker;
- nits list;
- recommended sections and ADRs likely affected;
- operational artifact paths.
-
Present a concise user-facing summary of the review. Include the verdict, blocker/nit counts, the main themes, and the artifact paths written for the round.
-
Do NOT modify the TechSpec or ADRs yet.
Step 5: User-Directed Incorporation
- Ask the user which findings to incorporate:
- A) all blockers
- B) selected blockers/nits
- C) nothing
- D) manual edits before any incorporation
- Apply only the findings the user selected. Do not silently apply all blockers or all nits.
- If incorporation requires an ADR update, update only the ADRs tied to the selected findings.
- Record the incorporation decision in
.compozy/tasks/<slug>/qa/peer-review-incorporation-roundN.md, listing:
- incorporated items;
- deferred items;
- files changed.
- Show the user what changed and what remains deferred.
Step 6: Optional Additional Rounds
- Ask whether the user wants another peer-review round or wants to stop with the current saved spec.
- If the user requests another round, re-run from Step 2 against the updated TechSpec and create a fresh
roundN+1 artifact set.
- Do not auto-loop. The user explicitly requests further rounds.
Critical Rules
- This skill never commits, pushes, opens PRs, auto-approves specs, or invokes provider review fetchers.
- Prompt, event log, findings, summary, incorporation, and status snapshot artifacts are versioned with
-roundN. Never overwrite a prior round.
- The
compozy exec call is the only place this skill spends external review credit. Do not invoke it more than once per round unless the round is explicitly invalid and the user requests a rerun.
- The bundled helper paths used by this skill (
references/peer-review-prompt.md, references/quality-markers.md, scripts/validate-findings.sh) are read-only templates/helpers — the skill reads or runs them, never edits them during a review round.
Error Handling
- Model misconfiguration (
The model 'X' does not exist): stop and surface the configured model. The IDE may be set to a stale name like gpt-5.5. Do not mutate the call to substitute a model — verify with the user. (See docs/_memory/lessons/L-010-model-name-validation.md.)
compozy exec not found: the skill assumes Compozy CLI is on PATH. If absent, fail with the install hint rather than swallowing.
- Quality markers missing: if the Step 1 quality-marker check fails, do not run Opus. Print the missing markers and exit so the user can amend the spec first.
- Missing findings file: treat this as an invalid round, not a clean review. Write a validation-error artifact and ask whether to rerun.
- Malformed findings frontmatter or missing required sections: treat this as an invalid round. Do not infer readiness from stdout.
- Empty or placeholder findings: treat empty
# Blockers or # Nits sections as acceptable only when the section explicitly says None.; reject TBD, TODO, or vague placeholders.
- Existing peer-review files: never overwrite. Prompt, event log, findings, summary, and incorporation files are all versioned with
-roundN.