Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

review-loop

Adversarial review→triage→fix loop until a cold verifier signs off. Fans out lens-specific reviewer subagents, verifies every finding against the code (killing false positives), auto-applies confirmed fixes as fixup commits, and repeats until a fresh verifier approves. Prefers a deterministic dynamic workflow when available; falls back to in-instance Task dispatch. Use when the user types /review-loop or asks to adversarially review-and-fix a change set, branch, or commit range until clean.

Ejecutar en Manus

Estrellas19

Forks3

Actualizado8 de junio de 2026, 04:25

Fuente

Roasbeef

Roasbeef/claude-files

Abrir repositorio de GitHub Ver repositorios del creador

Comando de instalación

Descarga

Ejecutar en Manus

Útil paraSOC

Analistas de garantía de calidad de software y probadoresOcupaciones informáticas y matemáticas15-1253L4

Explorador de archivos

2 archivos

SKILL.md

readonly

Más de este repositorio

mismo repositorio

agent-browser

Roasbeef/claude-files

Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.

2026-06-0819

technical-writing

Roasbeef/claude-files

Clear-writing guide distilled from Steven Pinker's "The Sense of Style." Use when writing or revising prose that must be clear to a reader — documentation, design docs, specs, explanations, essays, emails, reports, RFCs, release notes — or when asked to make writing clearer, tighter, less academic, or less jargon-laden. Activate for "make this clearer", "tighten this", "why is this hard to read", "edit this for clarity", or any prose-quality pass.

2026-06-0119

lnget

Roasbeef/claude-files

Use lnget to fetch resources from L402-protected URLs that require Lightning payments. Covers basic fetching, payment limits (max cost, max routing fee), token cache management, and Lightning backend status. Use when an HTTP request returns 402 Payment Required and a Lightning micropayment is needed, or when downloading files behind a Lightning paywall.

2026-05-2019

mutation-testing

Roasbeef/claude-files

Validates Go test suite quality through mutation testing using go-gremlins/gremlins. Mutates production code, runs the test suite against each mutant, and reports which mutants the tests fail to kill — exposing weak assertions that line coverage cannot detect. Use when evaluating test effectiveness, validating newly written tests, or improving test quality for mission-critical code (consensus, channel state, payment flows, crypto). Triggers: "mutation test", "are these tests strong", "validate test quality", "/mutation-testing".

2026-05-2019

test-refine

Roasbeef/claude-files

Refines an existing Go test suite — removes trivial/duplicate tests, strengthens weak assertions, reshapes tests around invariants, and closes branch-coverage gaps. Uses code-guided coverage and (when available) gremlins mutation-testing survivor data rather than relying on line coverage alone. Use when test quality is uneven, after a test-generation pass, before opening a PR, or as a quality gate on critical paths (consensus, channel state, payment flows). Triggers: "refine these tests", "tests are bloated", "tighten assertions", "remove trivial tests", "audit test quality", "/test-refine".

2026-05-2019

go-debug

Roasbeef/claude-files

Interactively debug Go programs in a single context using Delve (dlv) driven through tmux. Use when a bug requires runtime inspection — stepping through code, examining variables, walking goroutines, attaching to a live process, or debugging a hanging integration test — rather than just reading the source. Triggers include "step through this", "set a breakpoint", "attach to the running server", "why is this goroutine stuck", "debug this failing test".

2026-05-2019

name	review-loop
description	Adversarial review→triage→fix loop until a cold verifier signs off. Fans out lens-specific reviewer subagents, verifies every finding against the code (killing false positives), auto-applies confirmed fixes as fixup commits, and repeats until a fresh verifier approves. Prefers a deterministic dynamic workflow when available; falls back to in-instance Task dispatch. Use when the user types /review-loop or asks to adversarially review-and-fix a change set, branch, or commit range until clean.
argument-hint	[<commit-range> \| <branch> \| (default: branch vs base / uncommitted)] [--base=<branch>] [--max-iters=<N>] [--cutoff=high\|medium\|low] [--workflow\|--inline]
disable-model-invocation	true
allowed-tools	["Task","Workflow","Bash","Read","Write","Edit","MultiEdit","Grep","Glob","TodoWrite","AskUserQuestion"]

Review Loop

Run an adversarial review→triage→fix loop until a fresh cold verifier signs off. Unlike /code-review (report-only) this loop verifies every finding against the code, kills false positives, applies the confirmed fixes as fixup commits, and repeats until acceptance. All reviewing happens in subagents, so each reviewer burns its own context, not yours.

Target: $ARGUMENTS

Why this shape (and why a workflow)

This loop exists to defeat three failure modes that hit a single context window on long, adversarial tasks:

Agentic laziness — declaring a review done after partial coverage. The loop's fixed phases and convergence check force full coverage.
Self-preferential bias — grading your own findings. The triage judge and the final verifier are separate agents that never saw your reasoning.
Goal drift — losing the original constraints across turns. A design brief is passed verbatim to every agent.

Because those guarantees depend on the orchestration actually running every phase every time, the preferred execution path is a dynamic workflow (a deterministic JavaScript harness), not model-driven dispatch. The workflow encodes fan-out, triage, apply, loop, and verify as code that cannot drift or cut corners. The in-instance path below is the fallback when the Workflow tool is unavailable or the user passes --inline.

Phase 0: Scope, baseline, and design brief (always done by the main loop)

Do this in the first turn, before any dispatch, regardless of execution path.

Resolve scope into one concrete diff command and a stable description:

git branch --show-current
git show-ref --verify --quiet refs/heads/main && echo main || echo master
# Range given? use it. Branch given? <base>...<branch>.
# Else commits ahead of base? <base>..HEAD. Else uncommitted: git diff HEAD.
git diff <range> --stat ; git log <range> --oneline

Every finder and the verifier must review the same surface — record the exact diff command.

Capture a pre-flight baseline so pre-existing breakage is not blamed on the change:
```
make build 2>&1 | tail -5 ; make test 2>&1 | tail -5 ; make lint 2>&1 | tail -5
```
Note what was already red (e.g. a toolchain/lint-config issue) for the brief.
Write a design brief to .review-loop/brief.md — this is what makes triage accurate. Include: what the change does and why (approved intent); hard constraints and environment/protocol semantics reviewers can't infer from the diff; accepted tradeoffs and out-of-scope items; the pre-flight baseline.

Pick lenses from the changed files and record them in .review-loop/lenses.md. Always run the baseline adversarial panel; add specialized lenses when trigger files are present:

Lens	subagent_type	Trigger
Correctness	`code-reviewer`	always
Offensive security	`security-auditor`	always
Differential / blast radius	`general-purpose` + `differential-review` skill	always
Concurrency	`general-purpose`	goroutines, channels, mutexes, `sync`, atomics
Shell / config hardening	`general-purpose`	`*.sh`, Dockerfiles, CI YAML, hooks, settings
API safety & insecure defaults	`general-purpose` + `sharp-edges`/`insecure-defaults`	public interfaces, config, RPC/proto
Deep function analysis	`audit-context-building:function-analyzer`	crypto/auth, consensus, value-transfer
Spec compliance	`spec-to-code-compliance:spec-compliance-checker`	BIP/BOLT/protocol/spec references

mkdir -p .review-loop and track the run with TodoWrite (one item per phase, plus a per-round entry as the loop iterates).

Preferred path: dynamic workflow

When the Workflow tool is available and --inline was not passed, run the loop as a deterministic harness. The bundled script workflow/review-loop.js is a template — adapt it to the run (the chosen lens set, cutoff, and max-iters), do not assume it must run verbatim.

Invoke it via the Workflow tool, passing the Phase 0 artifacts as args:

Workflow({
  scriptPath: "<this skill dir>/workflow/review-loop.js",
  args: {
    diffCmd:   "<exact diff command>",
    base:      "<base branch>",
    brief:     "<contents of .review-loop/brief.md>",
    lenses:    [ /* the selected lens descriptors */ ],
    cutoff:    "medium",
    maxIters:  5,
  },
})

The workflow runs find→triage→apply→loop→verify and returns a structured summary (rounds, confirmed vs rejected per round, applied fixups, deferred follow-ups, verifier verdict). When it returns, the main loop does Phase 6 (finalize) below — autosquash offer and final green build — because those steps are interactive and side-effectful.

If the workflow hits maxIters without converging, it returns what remains rather than looping forever; surface that and ask how to proceed.

Fallback path: in-instance Task dispatch (`--inline` or no Workflow tool)

Run the same phases with the Task tool. This is what we ran by hand; it works but relies on the orchestrator faithfully executing each phase.

Phase 1 — dispatch finders

Launch every selected lens in one message with parallel Task calls (or run_in_background: true and collect notifications). Give each the same diff surface and the design brief, with this adversarial skeleton:

You are an ADVERSARIAL reviewer. BREAK this change, do not grade it. Only
report findings you can argue concretely from the code.
Scope (review exactly this): <diff command>
Design brief: <.review-loop/brief.md>
Your lens: <lens + specific failure modes to hunt>
For each finding return: stable id, file:line, severity
(critical/high/medium/low/info), a concrete trigger SCENARIO, and a minimal fix
sketch. A verified "not a bug" is useful signal. Raw list, no pleasantries.

Write outputs to .review-loop/round-<N>/find-<lens>.md.

Phase 2 — triage (never skip)

Spawn ONE general-purpose judge with all finder outputs + the brief + code read access. It must verify each finding against the cited lines (reject what it can't reproduce), dedup/merge, kill false positives with reasons, and classify survivors into fix-now (≥ cutoff; with a repo-style fix sketch), follow-up (deferrable; with an issue title), rejected (why). Write to .review-loop/round-<N>/triage.md. If a fix-now item contradicts the approved design, surface via AskUserQuestion before fixing.

Phase 3 — apply

For each fix-now finding in severity order: implement the minimal fix matching surrounding code; add/update tests when testable; build + relevant package tests must pass vs the Phase 0 baseline; commit as a fixup:

git add <files> ; git commit --fixup=<target-sha>

Use hunk stage for files mixing fix-now and deferred changes. Log to .review-loop/round-<N>/applied.md.

Phase 4 — loop

Increment the round, re-run Phase 1 finders on the new diff. New triage-confirmed fix-now findings → back to Phase 2/3. A clean round (zero new fix-now) → Phase 5. Stop at --max-iters and report what remains.

Phase 5 — cold acceptance verifier

Spawn ONE fresh code-reviewer that saw no prior round. Give it only the brief and the full final diff (<base>..HEAD) and ask: APPROVE, or RE-OPEN with concrete findings. APPROVE → Phase 6. RE-OPEN → feed findings into Phase 2 (subject to the same triage discipline).

Phase 6: Finalize (always done by the main loop)

Offer autosquash of the fixups into their originals:
```
hunk rebase autosquash --onto <base> --dry-run
```
Show the plan; on approval run for real. If fixups interleave with other commits on the same lines (conflict risk), instead offer a single review: commit via soft reset. Declined → leave fixups as-is.
Final verification: build + full tests + lint, green vs the Phase 0 baseline.
Summary (concise, to chat): rounds run, confirmed vs rejected per round, fixes applied (with commits), the deferred follow-up list (suggest opening issues), and the verifier verdict.

Notes

In-instance by design. Finders, triage, and verifier are subagents, so the heavy reading lives in their context, not yours. The Substrate path (/s-code-review) is the alternative when you want findings tracked in the review system / web UI; this trades that for lower context cost.
Never skip triage. Raw adversarial finders produce plausible-but-wrong findings; verify-and-reject is what makes auto-apply safe.
Cutoff discipline. Fix C/H/M in-loop; defer L/I to keep the loop converging and the diff focused. Surface deferred items, don't drop them.

review-loop

Más de este repositorio

Más de este repositorio

Review Loop

Why this shape (and why a workflow)

Phase 0: Scope, baseline, and design brief (always done by the main loop)

Preferred path: dynamic workflow

Fallback path: in-instance Task dispatch (--inline or no Workflow tool)

Phase 1 — dispatch finders

Phase 2 — triage (never skip)

Phase 3 — apply

Phase 4 — loop

Phase 5 — cold acceptance verifier

Phase 6: Finalize (always done by the main loop)

Notes

Review Loop

Why this shape (and why a workflow)

Phase 0: Scope, baseline, and design brief (always done by the main loop)

Preferred path: dynamic workflow

Fallback path: in-instance Task dispatch (--inline or no Workflow tool)

Phase 1 — dispatch finders

Phase 2 — triage (never skip)

Phase 3 — apply

Phase 4 — loop

Phase 5 — cold acceptance verifier

Phase 6: Finalize (always done by the main loop)

Notes

Fallback path: in-instance Task dispatch (`--inline` or no Workflow tool)

Fallback path: in-instance Task dispatch (`--inline` or no Workflow tool)