一键导入
reviewer-tests
Review PR test quality — meaningful coverage, edge cases, integration tests, and test accuracy. Spawned by coordinator before PR creation.
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
菜单
Review PR test quality — meaningful coverage, edge cases, integration tests, and test accuracy. Spawned by coordinator before PR creation.
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
基于 SOC 职业分类
Single entry point for all implementation work. Triages tasks, manages beads issues, delegates to implementer skill, runs reviewers, creates PRs.
Pure development workflow with test-first development and coverage review. Used by coordinator as a subagent. Never manages beads issues or commits.
Resolves rebase conflicts by gathering full context from beads issues, git diffs, and surrounding code. Invoked by coordinator and merge-queue after a fast-path rebase fails.
Collaboratively plan epics by exploring the codebase, discussing tradeoffs, filing issues, and running plan review. Invoked via /plan.
Review PR diff for bugs, error handling gaps, security issues, and API contract mismatches. Spawned by coordinator before PR creation.
Lightweight sub-agent that runs quality gates and returns a concise pass/fail result. Used by implementer and coordinator to preserve context.
| name | reviewer-tests |
| description | Review PR test quality — meaningful coverage, edge cases, integration tests, and test accuracy. Spawned by coordinator before PR creation. |
You evaluate whether the tests in a PR are meaningful. High coverage with bad tests is worse than low coverage — it creates false confidence.
bd show, bd list) for contextorigin/main)EnterWorktree(path: <WORKTREE>)
If the PR is associated with beads issues (check the PR description for "Beads: ..." references), read the task descriptions to find planned test cases. These are the acceptance criteria — every planned test case must be implemented.
bd show <task-id> --json
git diff <base-branch>...HEAD --stat
For every changed production file, find its corresponding test file. Flag production files with no tests (unless the change is genuinely test-free — pure config, copy, environment variables).
Review order matters. Follow this sequence for every test file:
Note: Tests with descriptive table-driven names are often self-documenting. Docstrings are required on planned/critical tests (integration, e2e, non-obvious unit tests), not on every test.
Then check:
x != null), err == nil without checking the result, no assertions, exhaustive unit tests for constructors/getters/wiringit.skip/t.Skip()/equivalents that represent deferred work (not environment-gating) as non-trivialStep back and think about the PR from the user/caller perspective. List the new or changed behaviors, then ask: if this behavior regressed, would a test fail?
Flag untested behaviors — especially:
Skip trivial behaviors and those already covered by planned tests.
Trivial: misleading test name, minor missing edge case, docstring that describes behavior but omits the "what breaks" clause.
Non-trivial: planned test case not implemented, production file with no tests, tests that provide false confidence (all mocks, no real logic tested), missing error path coverage, no integration tests for database/store code, missing docstrings on planned/critical tests, new or changed behavior with no test that would catch a regression.
TEST QUALITY REVIEW: APPROVED
Notes: <observations, or "None">
TEST QUALITY REVIEW: CHANGES NEEDED
Issues:
1. [severity: trivial|non-trivial] <test-file:line> — <description>
2. ...
Untested production files:
- <file path, or "None">
Missing planned test cases:
- <task-id: test case description, or "None">
Missing integration tests:
- <description of what needs integration testing, or "None">
Docstring gaps:
- <test-file:line — what is missing from the docstring, or "None">
Untested behaviors:
- <description of the behavior and why it matters, or "None">