Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

review-follow-up

Estrellas14

Forks1

Actualizado1 de julio de 2026, 04:14

Follow up on a PR/MR you reviewed: assess whether the author's fixes actually grasped each concern, find silently resolved or mechanically-fixed threads, and get a graded re-review call. Use after leaving review feedback to see if the author acted on it.

Instalación

Instalar con Codex o Claude Copia este prompt, pégalo en Codex, Claude u otro asistente, y deja que revise la página de la skill y la instale por ti.

Ejecutar en Manus

Fuente

bendrucker

bendrucker/claude

Abrir repositorio de GitHub Ver repositorios del creador

Descarga

Ejecutar en Manus

Explorador de archivos

3 archivos

SKILL.md

readonly

Más de este repositorio

mismo repositorio

claude-code-session

bendrucker/claude

Query Claude Code session history via a DuckDB index over `~/.claude/projects/`. Use when asked about Claude Code activity ("how many tokens today?", "what did I work on this week?") or instead of reading, grepping, or jq-ing session transcripts. Not for codebase search, git log queries, or arbitrary databases.

2026-07-0314

claude-code-skill

bendrucker/claude

Creating and optimizing Claude Code Skills including activation patterns, content structure, and development workflows. Use when creating new skills, converting memory files to skills, debugging skill activation, or understanding skill architecture and best practices.

2026-07-0314

things-jxa

bendrucker/claude

Read and query Things 3 data (lists, todos, projects, tags, logbook). Not for writes. Use things:url to create/update, things:inbox for quick captures.

2026-07-0214

improve-claude-code

bendrucker/claude

Triage and batch-implement Claude-tagged Things todos as PRs for the claude config repo, discover improvement candidates from session history, or watch open PRs to implement review feedback and close shipped todos. Use when the user wants to work on their Claude Code improvement backlog, process Things todos tagged claude-code, batch-implement configuration changes, mine session history for grounded config-change candidates (Discover mode), or watch this skill's open PRs for review feedback and merges (Watch mode).

2026-07-0214

github-pr-comments

bendrucker/claude

Fetch, reply to, and resolve review threads on a GitHub pull request. Use when checking what review feedback needs addressing, whether threads are resolved, replying to review comments, or clearing AI-reviewer threads after acting on them.

2026-07-0214

review-inbox

bendrucker/claude

Live tmux inbox for reviewing inbound pull requests across GitHub and GitLab. Use when reviewing multiple PRs, checking the review queue, batch reviews, or managing your review inbox. Pass --queue to spawn an already-ordered queue.

2026-07-0214

name	review:follow-up
description	Follow up on a PR/MR you reviewed: assess whether the author's fixes actually grasped each concern, find silently resolved or mechanically-fixed threads, and get a graded re-review call. Use after leaving review feedback to see if the author acted on it.
argument-hint	[pr-url] [--on-behalf-of <reviewer>]
allowed-tools	["Bash(gh:)","Bash(glab:)","Bash(git:*)","Task","Monitor"]

Review Follow-Up

Follow up on my review of: $ARGUMENTS

$ARGUMENTS is optional. When it carries a PR/MR URL or identifier, follow up on that. When it's empty, resolve the target from the current branch's open PR/MR before anything else (see Resolve Target). Only ask me to paste a URL when that resolution genuinely fails.

A --on-behalf-of <reviewer> flag selects a trust-gated, unattended loop that auto-re-approves in <reviewer>'s name as fixes land. See references/on-behalf-of.md for the procedure. Without the flag, follow-up is interactive.

You are the reviewer. The question is not "did the author reply?" but "did the fix actually grasp each concern, or go through the motions?" Status is a signal. The code is the verdict.

All diff and file reading happens in Explore sub-agents. Your main session holds only the structured verdicts they return, never raw diffs, keeping my development context free.

Context

Remote URL: !git remote get-url origin 2>/dev/null || echo "unavailable"
Branch: !git branch --show-current 2>/dev/null || echo "unavailable"

Workflow

Resolve Target

Settle on a PR/MR URL before any other step; everything downstream keys off it.

If $ARGUMENTS carries a URL or identifier, use it. If it's empty, detect the current branch's open PR/MR. Pick the platform from git remote get-url origin (host contains github.com => GitHub, else the GitLab instance that remote points at), then:

GitHub: gh pr view --json url,state --jq '.url'
GitLab: glab mr view --output json | jq -r '.web_url'

Both resolve the current branch with no positional argument. Feed the URL into Detect Platform unchanged.

Ask me for a URL only after detection genuinely comes up empty, and report what you tried: the branch (git branch --show-current) and the origin you resolved from. Edge cases:

Detached HEAD (no branch) => skip detection, ask.
Multiple remotes => use origin; I can pass a URL if that's wrong.
Closed or merged => still resolves; surface the state so I can confirm before proceeding.

Detect Platform

Parse the URL from $ARGUMENTS to determine the platform. GitHub URLs contain github.com, GitLab URLs contain the GitLab instance hostname.

GitLab

Load the gitlab:api and gitlab:merge-request skills for API patterns and MR tooling.

Sync to the Latest Pushed State

Your assessment must reflect what the author actually pushed, not a stale local checkout.

The worktree-behind-origin trap: judging a local worktree that is N commits behind origin makes clean fixes look unaddressed. Never assess local state without syncing.

Prefer the remote-of-record diff. glab and gh API diffs always reflect the server, so favor them over the local worktree.
If a sub-agent must read the local worktree, it has to git fetch first and compare against origin/<branch>, never the local checkout.

Establish the review baseline: the commit/version at which you last reviewed, plus current head. The fix assessment diffs that range, not the whole MR.

GitLab: the gitlab:merge-request skill exposes MR versions and diff_refs to bound the range from your last-reviewed version to head.
GitHub: use your review's commit SHA to head.

Fetch Threads

Gather all resolvable discussion threads that I started (both resolved and unresolved). Retain each thread's original comment body and location. The sub-agents need the concern's intent, not its status.

GitHub

Use the github:pr-comments skill's script with --role reviewer --include-resolved to get every thread you started, resolved and unresolved, each with its full comment chain so silent resolves are visible. Do not pass --since. Follow-up needs the whole history.

If a query ever needs hand-authoring, write it to a file with the Write tool and pass gh api graphql -F query=@file. Never use an inline shell heredoc. A ! in a GraphQL type marker breaks under the Bash tool's escaping.

GitLab

Use the gitlab:merge-request skill's discussions tooling to fetch the resolvable threads you started, both resolved and unresolved. You need each thread's original body, its location, and any later notes (the author's replies), so pull the full discussion payload, not just the first-note summary. That skill covers the script, username lookup, and the raw-payload caveat.

Classify Threads

Sort each thread into one of four buckets, treating the bucket as a signal, not a verdict:

Unresolved / no author reply: Author hasn't responded
Unresolved / author replied: Author responded, evaluate the reply
Resolved / with reply: Author replied and resolved
Resolved / silent: Author resolved without replying

A system-generated note (a version or diff note, not a human comment) is not a substantive reply. Don't count it as the author engaging with the concern. The fetch step's skill covers how these look on each platform.

The code-grounded verdict from the next step is the source of truth. A mismatch between status and code reality (resolved but not fixed, unresolved but fixed) is a finding.

Assess Fixes

The core step. Dispatch read-only Explore sub-agents (Task tool) to read the post-review diff and judge each fix. Group threads by the file they sit on and dispatch one Explore agent per file. When a single file carries many threads, fall back to per-thread agents. Cap parallelism at roughly 6-8 concurrent.

Give each agent:

the file path
every thread on that file: original comment body, location, and the underlying intent
the review-baseline→head diff range from the sync step

Each agent reads that file's post-review diff once and returns a compact structured verdict per thread. No raw diffs come back, only these fields:

location: file:line or thread id
concern: the original intent, one line
change: what the author actually did, code-grounded, one line (or "no change found")
verdict: Addressed | Partial | Not addressed | Dismissed | Can't tell
quality: Solid | Mechanical (satisfies the words, misses the intent) | Went further | Adopted verbatim | Workaround/band-aid | Introduced new issue | n/a (use when nothing changed; quality only applies to a fix that exists)
status_mismatch: e.g. "resolved but not actually fixed", "unresolved but fixed", or none
confidence: high | med | low
focus: boolean, does this warrant my eyes

Spell out these assessment lenses in each agent's prompt:

Does the fix address the underlying concern, or only the literal surface words?
Is it partial, fixing one call site but not the others, or handling the happy path only?
Was it dismissed? If so, is the dismissal reasonable?
Did the author go further than asked, or adopt my wording verbatim (a sign they fed my comment back into a model without engaging)?
Did the fix introduce a new issue near the change?

Quick Pass for New Issues

Have the same per-file agents flag problems the fixes introduced that aren't tied to any thread. Surface these separately from the thread verdicts.

Verify Author Claims

If the author claims they tested, validated, or ran something, check for evidence: relevant commits, test changes, MR-body checkboxes that are checked. Flag unsubstantiated claims rather than taking them at face value.

Present: Attention-Ranked Report

Lead with what needs my eyes. Don't emit a uniform full-detail row per thread.

Needs your attention

Flagged thread verdicts (focus = true), new issues, and unverified claims. Each one concise and code-grounded, with its confidence. Include any status-vs-reality mismatch.

Cleanly addressed

Collapse solid fixes to a count plus one-liners. Don't expand them.

Recommendation

A graded call:

Approve: everything addressed cleanly
Approve with comments: addressed, with minor nits you can fix in a follow-up PR
Request changes: partial, mechanical, or unaddressed concerns remain

Name the minor-nit lane explicitly: approving while requesting trivial follow-ups is an option, not a forced choice between approve and block.

Execute Actions

After I decide on next steps, use platform-appropriate commands:

GitHub

Approve: gh pr review --approve
Comment: gh pr comment
Resolve thread: gh api graphql with resolveReviewThread mutation

GitLab

Approve, resolve or unresolve threads, and comment via the gitlab:merge-request skill.

Guardrails

Check with me before submitting any review or comments.
Don't approve automatically: present the evidence, let me decide.
Flag silently resolved threads so I can decide whether to reopen or accept.
Distinguish code fixes from dismissals: a resolved thread with matching code changes is different from one resolved with no changes.
Sync before judging: never assess stale local state. Fetch and diff against origin first.
Status is a signal, the code is the verdict: a mismatch between the two is a finding, not noise.
Intent over words: distinguish a fix that meets the concern from one that mechanically satisfies its literal wording.
Behalf-of approval is private/internal only: auto-re-approval on another reviewer's behalf (references/on-behalf-of.md) is allowed only on private or internal repos, never in someone else's name on a public repo. Fail closed if visibility is undeterminable.