Run any Skill in Manus with one click

$pwd:

deep-review-task

Name: Deep Review Task
Author: harbor-framework

// Run the sandboxed deep-review tool against a benchmark task PR — launches Claude Code in Docker with pre-fetched PR artifacts and writes review-summary.md / issues-found.md

Run Skill in Manus

$ git log --oneline --stat

stars:208

forks:260

updated:May 16, 2026 at 19:41

SKILL.md

readonly

name	deep-review-task
description	Run the sandboxed deep-review tool against a benchmark task PR — launches Claude Code in Docker with pre-fetched PR artifacts and writes review-summary.md / issues-found.md
allowed-tools	Bash
argument-hint	<pr-url> [--review]

Run the tools/deep-review sandboxed reviewer against a benchmark task PR. The tool spins up Claude Code inside a locked-down Docker container, pre-fetches all PR artifacts, and produces a full GUIDE.md-style review.

Prerequisites: Docker Desktop running, gh CLI authenticated, and either claude CLI logged in or ANTHROPIC_API_KEY set in tools/deep-review/.env. macOS or Linux/WSL.

Phase 0: Parse arguments

$ARGUMENTS is the PR URL, optionally followed by --review to auto-post a REQUEST_CHANGES review when the run finishes.

If no PR URL is given, stop and ask the user for one. Do not invent a URL.

Phase 1: Locate the tool

Find the repo root and verify tools/deep-review/run.sh exists:

REPO_ROOT=$(git rev-parse --show-toplevel)
RUN_SH="$REPO_ROOT/tools/deep-review/run.sh"
test -x "$RUN_SH" || { echo "deep-review not found at $RUN_SH — pull latest main"; exit 1; }

If missing, tell the user to git pull origin main (the tool landed in PR #605, commit 29b1f353) and stop.

Phase 2: Run it

Invoke run.sh directly with the PR URL. The default built-in prompt drives the full GUIDE.md review end-to-end — do not pass a custom prompt unless the user asked for one. Stream output live so the user sees thinking, tool calls, and the final cost line.

cd "$REPO_ROOT/tools/deep-review"
./run.sh --pr "<pr-url>"         # or add --review to auto-post

Pass --review only if the user included it in $ARGUMENTS or explicitly asked. Posting to GitHub is visible to others — don't decide on your own.

Phase 3: Surface the outputs

When the run completes, the task directory is tools/deep-review/tasks/<owner>/<repo>/pr-<N>/. Tell the user the paths to:

review-summary.md — full review
issues-found.md — auto-extracted ## Issues Found section
natural-difficulty-extensions.md — if the agent populated that section

Offer to open review-summary.md with open -g <path> so the user can read it in their default viewer.

If --review was used, the final Review posted: <url> line from run.sh is the posted review. Surface that URL to the user.

Notes

Concurrent runs against different PRs work — each PR has its own namespaced directory.
Re-running the same PR refreshes pre-fetched inputs in place; prior review-summary.md is overwritten by the agent.
Custom prompts (advanced): ./run.sh "your prompt" --pr <url> — only use this if the user asked for a narrower focus.
For configuration (model, effort, API key), see tools/deep-review/README.md.

related-skills.json

same repository

convert-separate-verifier.md

from "harbor-framework/terminal-bench-3"

Convert a TB3 task from Harbor's shared verifier mode (default) to separate verifier mode. Use when the user asks to "convert this task to separate verifier", "make the verifier run in its own container", or asks about Harbor's separate-verifier environment for a specific task.

2026-05-30208

trial-costs.md

from "harbor-framework/terminal-bench-3"

Sum agent-trial spend ($) from TB3 PR comments and Modal compute spend. Reports totals by kind (/run vs /cheat) and provider (anthropic, openai, ...), top-spender PRs, per-task latest trial cost, and daily Modal billing. Use when the user asks how much was spent on agent trials, cost breakdown by provider, PR-level spend, or Modal usage.

2026-05-16208

tb3-status.md

from "harbor-framework/terminal-bench-3"

Generate TB3 review status report and open in browser

2026-05-13208

update-rubric.md

from "harbor-framework/terminal-bench-3"

Propose new rubric criteria based on review findings — creates a branch and PR

2026-04-10208

package.json

"author": "harbor-framework"

"repository": "harbor-framework/terminal-bench-3"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Software Quality Assurance Analysts and TestersComputer and Mathematical Occupations15-1253L4

name	deep-review-task
description	Run the sandboxed deep-review tool against a benchmark task PR — launches Claude Code in Docker with pre-fetched PR artifacts and writes review-summary.md / issues-found.md
allowed-tools	Bash
argument-hint	<pr-url> [--review]

Prerequisites: Docker Desktop running, gh CLI authenticated, and either claude CLI logged in or ANTHROPIC_API_KEY set in tools/deep-review/.env. macOS or Linux/WSL.

Phase 0: Parse arguments

$ARGUMENTS is the PR URL, optionally followed by --review to auto-post a REQUEST_CHANGES review when the run finishes.

If no PR URL is given, stop and ask the user for one. Do not invent a URL.

Phase 1: Locate the tool

Find the repo root and verify tools/deep-review/run.sh exists:

REPO_ROOT=$(git rev-parse --show-toplevel)
RUN_SH="$REPO_ROOT/tools/deep-review/run.sh"
test -x "$RUN_SH" || { echo "deep-review not found at $RUN_SH — pull latest main"; exit 1; }

If missing, tell the user to git pull origin main (the tool landed in PR #605, commit 29b1f353) and stop.

Phase 2: Run it

cd "$REPO_ROOT/tools/deep-review"
./run.sh --pr "<pr-url>"         # or add --review to auto-post

Pass --review only if the user included it in $ARGUMENTS or explicitly asked. Posting to GitHub is visible to others — don't decide on your own.

Phase 3: Surface the outputs

When the run completes, the task directory is tools/deep-review/tasks/<owner>/<repo>/pr-<N>/. Tell the user the paths to:

review-summary.md — full review
issues-found.md — auto-extracted ## Issues Found section
natural-difficulty-extensions.md — if the agent populated that section

Offer to open review-summary.md with open -g <path> so the user can read it in their default viewer.

If --review was used, the final Review posted: <url> line from run.sh is the posted review. Surface that URL to the user.

Notes

Concurrent runs against different PRs work — each PR has its own namespaced directory.
Re-running the same PR refreshes pre-fetched inputs in place; prior review-summary.md is overwritten by the agent.
Custom prompts (advanced): ./run.sh "your prompt" --pr <url> — only use this if the user asked for a narrower focus.
For configuration (model, effort, API key), see tools/deep-review/README.md.

deep-review-task

Phase 0: Parse arguments

Phase 1: Locate the tool

Phase 2: Run it

Phase 3: Surface the outputs

Notes

More from this repository

More from this repository

Phase 0: Parse arguments

Phase 1: Locate the tool

Phase 2: Run it

Phase 3: Surface the outputs

Notes