Run any Skill in Manus with one click

$pwd:

panel-review

Name: Panel Review
Author: venables

// Run a parallel code review across multiple local CLI coding agents (codex, claude, opencode, gemini) and synthesize their findings. Use this skill whenever the user asks for a "panel review" / "panel-review", "second opinions on this change", "multi-agent review", "ensemble review", a "deep panel review" / "really dig into this PR" / "run the tests on this PR" (the deep mode that runs tests and greps callers in a real worktree checkout), asks to "have multiple agents/LLMs review this", "cross-check this with codex/claude/etc", "fan out a code review", or any similar phrasing asking for independent reviews from outside this conversation. Each panelist runs in a fresh non-interactive subprocess with no shared state — that is the point. Do NOT use when the user just wants the current session to review code itself; use the regular code-reviewer agent for that.

Run Skill in Manus

$ git log --oneline --stat

stars:0

forks:0

updated:May 6, 2026 at 18:32

File Explorer

5 files

SKILL.md

readonly

package.json

"author": "venables"

"repository": "venables/skills"

View GitHub Repository

$ install --globalskills.sh

$ download --local

Run Skill in Manus

[HINT] Download the complete skill directory including SKILL.md and all related files

Run any Skill with one click

name

panel-review

description

Run a parallel code review across multiple local CLI coding agents (codex, claude, opencode, gemini) and synthesize their findings. Use this skill whenever the user asks for a "panel review" / "panel-review", "second opinions on this change", "multi-agent review", "ensemble review", a "deep panel review" / "really dig into this PR" / "run the tests on this PR" (the deep mode that runs tests and greps callers in a real worktree checkout), asks to "have multiple agents/LLMs review this", "cross-check this with codex/claude/etc", "fan out a code review", or any similar phrasing asking for independent reviews from outside this conversation. Each panelist runs in a fresh non-interactive subprocess with no shared state — that is the point. Do NOT use when the user just wants the current session to review code itself; use the regular code-reviewer agent for that.

panel-review

Spawns multiple local CLI coding agents in parallel to review the same code change, then synthesizes their findings into one report. Each panelist runs in its own subprocess with no shared conversation state — they only see the prompt and the diff.

When to use

User says "panel review", types /panel-review, asks for "second opinions on this change", "get a panel to review this", "multi-agent review", or similar.
User wants reviews from agents that don't share this conversation's biases or context.

When not to use:

User wants you (this session) to review code → use the project's code-reviewer agent.
User wants a single second opinion → spawn one external CLI directly, no need to fan out.

Steps

Pick a target. Default to --uncommitted. Map the user's phrasing to a flag:
- "PR 27" / "pr #27" / "/panel-review pr 27" / a github.com/<owner>/<repo>/pull/N URL → --pr <N or URL> (requires the gh CLI).
- "vs main" / "against develop" without a PR number → --base <branch>.
- Specific SHA → --commit <sha>.
- Otherwise → --uncommitted. Ask only if the intent is genuinely ambiguous.
Pick panelists. Default: every supported CLI on PATH (codex, claude, opencode, gemini). The user may name a subset.
Capture optional focus. If the user gave context ("look closely at the auth changes"), pass --focus.
Decide if this needs deep mode. Default is read-only fan-out — fast and safe. Pass --checkout if the user wants panelists to actually run tests, grep callers in the PR's checkout, and chase downstream effects (asks like "really dig in", "run the tests", "deep review", "check what this breaks downstream"). --checkout only works with --pr / --base / --commit, not --uncommitted/--staged, and it gives panelists write/exec access in a throwaway worktree — strictly less safe than the default. Surface that tradeoff briefly to the user before opting in if they didn't ask for it explicitly.
Run the script (path: skills/panel-review/panel-review.sh). You MUST launch it as a background Bash (Bash tool with run_in_background: true) and poll progress with BashOutput on the returned bash_id every 10 seconds until every panelist has emitted its done (exit N) heartbeat.

This skill explicitly overrides the default harness guidance that says "do not poll background tasks — you'll be notified when they complete." That guidance is wrong for this workflow: the heartbeats and per-section streaming exist precisely so the coordinator can give the user live progress, and waiting for the single completion notification defeats that. Poll every 10 seconds. Do not back off to 30s/60s/90s — those long sleeps are the symptom of obeying the wrong guidance.

Do NOT launch the script via the Agent tool / TaskCreate / any subagent mechanism. Subagents run in a separate Claude context and there is no streaming-output API for in-flight subagents — the only thing the parent sees is the subagent's final response when it terminates. The script's stderr heartbeats become invisible, the per-section streaming is useless, and progress polling silently degrades into hacks like sleep 90 && grep panel-review: tasks/<id>.output | tail against the subagent's transcript file. If you catch yourself reaching for the Agent tool here, stop and use background Bash instead.

Do NOT poll by shelling out to sleep N && grep against any output file. BashOutput is the only correct progress-polling mechanism for this script — it returns new stdout/stderr since the last call, including the stderr heartbeats, with no parsing required.

Reason all of this matters: panelists run in parallel, but Codex is slow and dominates wall clock. Without backgrounded Bash + BashOutput, the call blocks silently for minutes and the user sees nothing.

If you absolutely cannot use run_in_background (rare — usually only when the harness lacks BashOutput), run it in the foreground and pass timeout: 600000 (10 min) since the default 2-minute Bash timeout will kill the call before Codex returns. You will lose live progress in this mode; warn the user.
Set up live progress UX. Right before (or right after) launching the script:
- Call TodoWrite with one todo per chosen panelist (Review: codex, Review: claude, …) plus a final Synthesize findings todo. Initial statuses: the first panelist's in_progress, the rest pending. (Some harnesses expose this as TaskCreate / TaskUpdate instead — use whichever todo-list tool your environment provides.)
- Each BashOutput poll: scan stderr for panel-review: <name> started and panel-review: <name> done (exit N). On a started, set that panelist's todo to in_progress (re-call TodoWrite with the updated list). On a done, set it to completed, post a single-line user-visible status (✓ <name> done — N findings, top severity <SEV> or ✓ <name> — NO_FINDINGS / ✗ <name> failed (exit N)), and immediately post that panelist's full ## <name> (exit N) section to chat as soon as it appears in stdout. Do not wait until every panelist is done — surfacing each section the moment it lands gives the user actionable findings 5–10 minutes before synthesis is possible. The synthesis step still adds value by deduping/ranking across panelists; it is not a substitute for the individual sections.
- After the last done heartbeat, set Synthesize findings to in_progress, proceed to steps 7–8, then mark it completed.
Read the script's combined output — it prints one section per panelist with their raw findings, plus a tempdir path containing each panelist's stdout/stderr. Wait for all panelists to finish before synthesizing; partial output is fine to show the user during the wait, but consensus / disagreement analysis needs every panelist's verdict.
Synthesize the findings in your reply to the user:
- Consensus — issues raised by 2+ panelists, deduplicated. List file:line + the core problem and a suggested fix.
- Unique findings — per panelist, only the findings no one else mentioned that still pass the "would a competent reviewer ask for this change" bar.
- Action list — must-fix → should-fix → optional polish.
- Disagreements — if panelists contradict each other, surface that explicitly rather than picking a side.
Don't paraphrase or invent. Surface what the panelists actually said. If a panelist returned NO_FINDINGS, note it; don't drop the panelist from the report.

Usage

bash skills/panel-review/panel-review.sh [target] [options]

Targets (pick one, default --uncommitted):

--uncommitted — staged + unstaged changes
--staged — staged only
--base BRANCH — BRANCH...HEAD
--commit SHA — a single commit
--pr NUMBER — a GitHub PR (also accepts a full PR URL); requires gh CLI

Options:

--focus TEXT — extra context for the reviewers
--panelist NAME — repeatable; one of codex, claude, opencode, gemini
--out-dir DIR — where to capture outputs (default: mktemp -d)
--timeout SECS — per-panelist timeout (default 600)
--checkout — deep mode: materialize the target ref into a throwaway git worktree and run panelists from inside it with write/exec permissions. Required for --pr / --base / --commit only. Strictly less safe than the default.

Examples:

# All available panelists, uncommitted changes
bash skills/panel-review/panel-review.sh

# Codex + Claude only, against main
bash skills/panel-review/panel-review.sh --base main --panelist codex --panelist claude

# With focus
bash skills/panel-review/panel-review.sh --uncommitted --focus "session-token handling"

# A GitHub PR
bash skills/panel-review/panel-review.sh --pr 27
bash skills/panel-review/panel-review.sh --pr https://github.com/owner/repo/pull/27

# Deep mode: panelists run tests + grep callers inside a real PR checkout.
# Each panelist gets write/exec access to a throwaway worktree; the worktree
# is git-worktree-removed on exit.
bash skills/panel-review/panel-review.sh --pr 27 --checkout
bash skills/panel-review/panel-review.sh --base main --checkout --panelist codex

Per-panelist tuning (env vars)

Pass through model selection without code changes:

CODEX_MODEL — e.g. gpt-5
CLAUDE_MODEL — e.g. opus, sonnet, full id like claude-sonnet-4-6
OPENCODE_MODEL — e.g. qwen/qwen-3.6 (run opencode models to list)
OPENCODE_AGENT — opencode agent in default (read-only) mode (default plan)
OPENCODE_AGENT_DEEP — opencode agent in --checkout mode (default build)
GEMINI_MODEL — e.g. gemini-2.5-pro

Other knobs:

PANEL_REVIEW_TIMEOUT — seconds per panelist (default 600)
PANEL_REVIEW_MAX_DIFF_BYTES — abort if the inline diff exceeds this (default 200000, ~200KB). Big rename / refactor PRs blow past this; just bump it (e.g. PANEL_REVIEW_MAX_DIFF_BYTES=1000000) when prompted. Each panelist embeds the full diff in its prompt, so the cap exists to protect context windows and cost — not to gate "real" reviews.

How it works

Builds a unified diff from the chosen target with git.
Composes one prompt by prepending prompts/review.md (or prompts/review-deep.md in --checkout mode) to the diff (and optional focus).
Spawns each panelist as a background subprocess with read-only / plan-mode flags so the reviewer can read repo files but cannot modify anything (default mode).
Streams each panelist's section to stdout the moment that panelist completes, and emits unbuffered stderr heartbeats (started / done) so a coordinator polling BashOutput sees real-time progress instead of one big dump at the end.

Deep mode (`--checkout`)

Resolves a single commit SHA the worktrees will all pin to:
- --pr N (or URL) — gh pr view "$pr_ref" returns the PR's URL, head SHA, and head nameWithOwner. Constructs the head clone URL using the PR URL's own host (so GitHub Enterprise works), then git fetch once from that URL by SHA. One network round-trip; subsequent worktrees are local-only.
- --base BRANCH — git rev-parse HEAD.
- --commit SHA — uses the SHA directly.
Materializes one detached worktree per panelist at $OUT_DIR/worktree-<name>. Each worktree is independent — test runners can write to their own node_modules/, target/, .next/ without racing siblings (CI-matrix style).
Each panelist's CWD is its own worktree. Per-panelist permission flags swap to their most permissive non-interactive equivalents:
- codex: --sandbox workspace-write
- claude: --permission-mode bypassPermissions
- opencode: --agent build --dangerously-skip-permissions
- gemini: --approval-mode yolo
A trap … EXIT registered before the worktree-creation loop iterates WORKTREES and runs git worktree remove --force on each. Cleans up however many worktrees were added before any failure or signal. Captured stdout/stderr/diff/prompt files in the parent OUT_DIR are kept for postmortem.
The deep-mode prompt explicitly forbids destructive network actions (push, PR/issue writes, package publishes) since the sandbox flags no longer enforce that. This is a softer guard than read-only mode — treat panelists as untrusted code execution against your dev box.

Notes

Each panelist is a fresh process — that's the "no prior context" guarantee. They will still pick up the project's AGENTS.md/CLAUDE.md, which is intentional: those encode project conventions worth respecting in a review.
The prompt explicitly forbids GitHub/Slack/Linear writes. In default mode, panelists are run in read-only / plan mode at the CLI level as a second line of defense; in --checkout mode, that second line is gone and the prompt is the only guard against destructive actions.
--checkout panelists share the parent repo's .git object database via git worktree. A stray git push from a panelist would publish from your worktree, even though the worktree itself is throwaway. The deep-mode prompt explicitly forbids this; treat the prompt as a firewall, not a sandbox.
If a panelist times out or errors, the script keeps the others' output and exits 2. Surface the failure in your synthesized report rather than silently dropping it.
The combined output references a tempdir like /tmp/panel-review-XXXXXX/ — re-read any panelist's raw output from there if you need more detail than the inline excerpt.

name

panel-review

description

panel-review

When to use

User says "panel review", types /panel-review, asks for "second opinions on this change", "get a panel to review this", "multi-agent review", or similar.
User wants reviews from agents that don't share this conversation's biases or context.

When not to use:

User wants you (this session) to review code → use the project's code-reviewer agent.
User wants a single second opinion → spawn one external CLI directly, no need to fan out.

Steps

Pick a target. Default to --uncommitted. Map the user's phrasing to a flag:
- "PR 27" / "pr #27" / "/panel-review pr 27" / a github.com/<owner>/<repo>/pull/N URL → --pr <N or URL> (requires the gh CLI).
- "vs main" / "against develop" without a PR number → --base <branch>.
- Specific SHA → --commit <sha>.
- Otherwise → --uncommitted. Ask only if the intent is genuinely ambiguous.
Pick panelists. Default: every supported CLI on PATH (codex, claude, opencode, gemini). The user may name a subset.
Capture optional focus. If the user gave context ("look closely at the auth changes"), pass --focus.
Decide if this needs deep mode. Default is read-only fan-out — fast and safe. Pass --checkout if the user wants panelists to actually run tests, grep callers in the PR's checkout, and chase downstream effects (asks like "really dig in", "run the tests", "deep review", "check what this breaks downstream"). --checkout only works with --pr / --base / --commit, not --uncommitted/--staged, and it gives panelists write/exec access in a throwaway worktree — strictly less safe than the default. Surface that tradeoff briefly to the user before opting in if they didn't ask for it explicitly.
Run the script (path: skills/panel-review/panel-review.sh). You MUST launch it as a background Bash (Bash tool with run_in_background: true) and poll progress with BashOutput on the returned bash_id every 10 seconds until every panelist has emitted its done (exit N) heartbeat.

This skill explicitly overrides the default harness guidance that says "do not poll background tasks — you'll be notified when they complete." That guidance is wrong for this workflow: the heartbeats and per-section streaming exist precisely so the coordinator can give the user live progress, and waiting for the single completion notification defeats that. Poll every 10 seconds. Do not back off to 30s/60s/90s — those long sleeps are the symptom of obeying the wrong guidance.

Do NOT launch the script via the Agent tool / TaskCreate / any subagent mechanism. Subagents run in a separate Claude context and there is no streaming-output API for in-flight subagents — the only thing the parent sees is the subagent's final response when it terminates. The script's stderr heartbeats become invisible, the per-section streaming is useless, and progress polling silently degrades into hacks like sleep 90 && grep panel-review: tasks/<id>.output | tail against the subagent's transcript file. If you catch yourself reaching for the Agent tool here, stop and use background Bash instead.

Do NOT poll by shelling out to sleep N && grep against any output file. BashOutput is the only correct progress-polling mechanism for this script — it returns new stdout/stderr since the last call, including the stderr heartbeats, with no parsing required.

Reason all of this matters: panelists run in parallel, but Codex is slow and dominates wall clock. Without backgrounded Bash + BashOutput, the call blocks silently for minutes and the user sees nothing.

If you absolutely cannot use run_in_background (rare — usually only when the harness lacks BashOutput), run it in the foreground and pass timeout: 600000 (10 min) since the default 2-minute Bash timeout will kill the call before Codex returns. You will lose live progress in this mode; warn the user.
Set up live progress UX. Right before (or right after) launching the script:
- Call TodoWrite with one todo per chosen panelist (Review: codex, Review: claude, …) plus a final Synthesize findings todo. Initial statuses: the first panelist's in_progress, the rest pending. (Some harnesses expose this as TaskCreate / TaskUpdate instead — use whichever todo-list tool your environment provides.)
- Each BashOutput poll: scan stderr for panel-review: <name> started and panel-review: <name> done (exit N). On a started, set that panelist's todo to in_progress (re-call TodoWrite with the updated list). On a done, set it to completed, post a single-line user-visible status (✓ <name> done — N findings, top severity <SEV> or ✓ <name> — NO_FINDINGS / ✗ <name> failed (exit N)), and immediately post that panelist's full ## <name> (exit N) section to chat as soon as it appears in stdout. Do not wait until every panelist is done — surfacing each section the moment it lands gives the user actionable findings 5–10 minutes before synthesis is possible. The synthesis step still adds value by deduping/ranking across panelists; it is not a substitute for the individual sections.
- After the last done heartbeat, set Synthesize findings to in_progress, proceed to steps 7–8, then mark it completed.
Read the script's combined output — it prints one section per panelist with their raw findings, plus a tempdir path containing each panelist's stdout/stderr. Wait for all panelists to finish before synthesizing; partial output is fine to show the user during the wait, but consensus / disagreement analysis needs every panelist's verdict.
Synthesize the findings in your reply to the user:
- Consensus — issues raised by 2+ panelists, deduplicated. List file:line + the core problem and a suggested fix.
- Unique findings — per panelist, only the findings no one else mentioned that still pass the "would a competent reviewer ask for this change" bar.
- Action list — must-fix → should-fix → optional polish.
- Disagreements — if panelists contradict each other, surface that explicitly rather than picking a side.
Don't paraphrase or invent. Surface what the panelists actually said. If a panelist returned NO_FINDINGS, note it; don't drop the panelist from the report.

Usage

bash skills/panel-review/panel-review.sh [target] [options]

Targets (pick one, default --uncommitted):

--uncommitted — staged + unstaged changes
--staged — staged only
--base BRANCH — BRANCH...HEAD
--commit SHA — a single commit
--pr NUMBER — a GitHub PR (also accepts a full PR URL); requires gh CLI

Options:

--focus TEXT — extra context for the reviewers
--panelist NAME — repeatable; one of codex, claude, opencode, gemini
--out-dir DIR — where to capture outputs (default: mktemp -d)
--timeout SECS — per-panelist timeout (default 600)
--checkout — deep mode: materialize the target ref into a throwaway git worktree and run panelists from inside it with write/exec permissions. Required for --pr / --base / --commit only. Strictly less safe than the default.

Examples:

# All available panelists, uncommitted changes
bash skills/panel-review/panel-review.sh

# Codex + Claude only, against main
bash skills/panel-review/panel-review.sh --base main --panelist codex --panelist claude

# With focus
bash skills/panel-review/panel-review.sh --uncommitted --focus "session-token handling"

# A GitHub PR
bash skills/panel-review/panel-review.sh --pr 27
bash skills/panel-review/panel-review.sh --pr https://github.com/owner/repo/pull/27

# Deep mode: panelists run tests + grep callers inside a real PR checkout.
# Each panelist gets write/exec access to a throwaway worktree; the worktree
# is git-worktree-removed on exit.
bash skills/panel-review/panel-review.sh --pr 27 --checkout
bash skills/panel-review/panel-review.sh --base main --checkout --panelist codex

Per-panelist tuning (env vars)

Pass through model selection without code changes:

CODEX_MODEL — e.g. gpt-5
CLAUDE_MODEL — e.g. opus, sonnet, full id like claude-sonnet-4-6
OPENCODE_MODEL — e.g. qwen/qwen-3.6 (run opencode models to list)
OPENCODE_AGENT — opencode agent in default (read-only) mode (default plan)
OPENCODE_AGENT_DEEP — opencode agent in --checkout mode (default build)
GEMINI_MODEL — e.g. gemini-2.5-pro

Other knobs:

PANEL_REVIEW_TIMEOUT — seconds per panelist (default 600)
PANEL_REVIEW_MAX_DIFF_BYTES — abort if the inline diff exceeds this (default 200000, ~200KB). Big rename / refactor PRs blow past this; just bump it (e.g. PANEL_REVIEW_MAX_DIFF_BYTES=1000000) when prompted. Each panelist embeds the full diff in its prompt, so the cap exists to protect context windows and cost — not to gate "real" reviews.

How it works

Builds a unified diff from the chosen target with git.
Composes one prompt by prepending prompts/review.md (or prompts/review-deep.md in --checkout mode) to the diff (and optional focus).
Spawns each panelist as a background subprocess with read-only / plan-mode flags so the reviewer can read repo files but cannot modify anything (default mode).
Streams each panelist's section to stdout the moment that panelist completes, and emits unbuffered stderr heartbeats (started / done) so a coordinator polling BashOutput sees real-time progress instead of one big dump at the end.

Deep mode (`--checkout`)

Resolves a single commit SHA the worktrees will all pin to:
- --pr N (or URL) — gh pr view "$pr_ref" returns the PR's URL, head SHA, and head nameWithOwner. Constructs the head clone URL using the PR URL's own host (so GitHub Enterprise works), then git fetch once from that URL by SHA. One network round-trip; subsequent worktrees are local-only.
- --base BRANCH — git rev-parse HEAD.
- --commit SHA — uses the SHA directly.
Materializes one detached worktree per panelist at $OUT_DIR/worktree-<name>. Each worktree is independent — test runners can write to their own node_modules/, target/, .next/ without racing siblings (CI-matrix style).
Each panelist's CWD is its own worktree. Per-panelist permission flags swap to their most permissive non-interactive equivalents:
- codex: --sandbox workspace-write
- claude: --permission-mode bypassPermissions
- opencode: --agent build --dangerously-skip-permissions
- gemini: --approval-mode yolo
A trap … EXIT registered before the worktree-creation loop iterates WORKTREES and runs git worktree remove --force on each. Cleans up however many worktrees were added before any failure or signal. Captured stdout/stderr/diff/prompt files in the parent OUT_DIR are kept for postmortem.
The deep-mode prompt explicitly forbids destructive network actions (push, PR/issue writes, package publishes) since the sandbox flags no longer enforce that. This is a softer guard than read-only mode — treat panelists as untrusted code execution against your dev box.

Notes

Each panelist is a fresh process — that's the "no prior context" guarantee. They will still pick up the project's AGENTS.md/CLAUDE.md, which is intentional: those encode project conventions worth respecting in a review.
The prompt explicitly forbids GitHub/Slack/Linear writes. In default mode, panelists are run in read-only / plan mode at the CLI level as a second line of defense; in --checkout mode, that second line is gone and the prompt is the only guard against destructive actions.
--checkout panelists share the parent repo's .git object database via git worktree. A stray git push from a panelist would publish from your worktree, even though the worktree itself is throwaway. The deep-mode prompt explicitly forbids this; treat the prompt as a firewall, not a sandbox.
If a panelist times out or errors, the script keeps the others' output and exits 2. Surface the failure in your synthesized report rather than silently dropping it.
The combined output references a tempdir like /tmp/panel-review-XXXXXX/ — re-read any panelist's raw output from there if you need more detail than the inline excerpt.

panel-review

panel-review

When to use

Steps

Usage

Per-panelist tuning (env vars)

How it works

Deep mode (--checkout)

Notes

panel-review

When to use

Steps

Usage

Per-panelist tuning (env vars)

How it works

Deep mode (--checkout)

Notes

Deep mode (`--checkout`)

Deep mode (`--checkout`)