تشغيل أي مهارة في Manus بنقرة واحدة

peer-review

النجوم٢

التفرعات٥

آخر تحديث١٢ يونيو ٢٠٢٦ في ٠٥:٢٤

Peer review of research funding applications and academic submissions. Scheme-agnostic — fetches current criteria from the relevant handbook each round, since weights and language change. Covers Detailed Assessor and College-of-Experts / General Assessor roles, plus collegial draft review.

التثبيت

التثبيت باستخدام Codex أو Claude انسخ هذا Prompt والصقه في Codex أو Claude أو مساعد آخر ليراجع صفحة Skill ويثبّتها لك.

تشغيل في Manus

المصدر

nicsuzor

nicsuzor/academicOps

فتح مستودع GitHub عرض مستودعات المنشئ

تنزيل

تشغيل في Manus

المهن ذات الصلةSOC

استنادا إلى تصنيف SOC المهني

مطوّرو البرمجياتمهن الحاسوب والرياضيات·SOC 15-1252

مستكشف الملفات

10 ملفات

SKILL.md

readonly

المزيد من هذا المستودع

نفس المستودع

daily

nicsuzor/academicOps

Daily note lifecycle — compose and maintain a factual daily note. Reports the state of the day; does not prioritise or recommend. SSoT for daily note structure.

2026-06-242

remember

nicsuzor/academicOps

Unified memory skill: immediate mode (/remember) persists knowledge via PKB MCP; maintenance mode (/sleep, GHA cron) runs periodic consolidation — transcript mining, knowledge synthesis, data quality, brain sync.

2026-06-242

strategic-review

nicsuzor/academicOps

Unified multi-agent review of any artifact — a document, plan, proposal, or pull request. The calling agent deploys rbg, pauli, and marsha in parallel, then @james reconciles their findings into one verdict. Pass `comment` and/or `fix` to write the result back to the review surface. Use `--critic` for a fast pauli-only pre-hoc critique.

2026-06-242

craft

nicsuzor/academicOps

Instruction quality gate — reviews agent instructions (task bodies, workflow steps, skill procedures, self-test protocols) for shallow-execution vulnerabilities before deployment. Two modes: author (pre-hoc review) and audit (trace a failure back to the instruction gap). The bar is excellence, not compliance.

2026-06-242

planner

nicsuzor/academicOps

Strategic planning agent — graph structure ownership, task decomposition, knowledge-building, and PKM maintenance. Works on WHAT exists and HOW it relates.

2026-06-242

survey

nicsuzor/academicOps

Survey a corpus, classify, and dispatch outputs. Three modes: retro (transcript review → issues), trend (longitudinal performance analysis), sweep (GitHub issue triage → fix-epics). Delegates execution to pauli (retro/trend) or jr (sweep) to keep main context clean.

2026-06-242

name	peer-review
type	skill
description	Peer review of research funding applications and academic submissions. Scheme-agnostic — fetches current criteria from the relevant handbook each round, since weights and language change. Covers Detailed Assessor and College-of-Experts / General Assessor roles, plus collegial draft review.
category	instruction
triggers	["review this grant","review this application","future fellowship","DECRA","ARC assessment","ERC review","SNSF review","fellowship review","detailed assessor","general assessor","college of experts","process reviewer comments"]
modifies_files	true
needs_task	true
mode	workflow
domain	["academic"]
allowed-tools	Read,Write,Edit,Grep,Glob,Bash
version	3.0.0
permalink	skill-peer-review

Peer Review Skill

Prepare peer reviews of research funding applications or academic submissions, matching the current round's scheme criteria and producing evidence-based, signable feedback. The agent prepares and verifies; the academic owns the scores, the net call, and the final submitted text.

What this skill is, in one paragraph

A review is not a linear pipeline that ends in a self-check. It is an adaptive loop of five stages whose quality comes from two moves a naive draft-then-submit flow lacks: adversarial independent verification of every claim, and a voice-match + whole-set de-template pass that makes the prose signable and un-formulaic. Its failures happen at the seams between stages — a polish pass that quietly promotes a paraphrase into a fabricated quote; a voice rewrite that drops a verified correction. So the loop gates its regeneration boundaries: "verified" attaches to a committed artifact, not an idea, and any regeneration re-enters verification.

Core Rules

Evidence-Based: Every claim must cite the application (page, section, line, or quote).
Absent Evidence Is Feedback: A missing required element is a scoreable weakness, not an omission to skip silently — but say where you looked (a documented grep with synonyms), never assert an absence you didn't search for.
Fetch Round Criteria: Fetch the current scheme guidelines (ARC, ERC, SNSF, NHMRC) for this round. Weights, bands, and character minimums change; do not reuse prior criteria.
Human Owns Judgment: Scores, top-line read, net call, and submitted text are the academic's. The agent prepares evidence-mapped drafts with scores left blank. This is how funding-body GenAI policies (e.g. ARC's ban on AI-generated assessor text) are satisfied structurally — not by refusing to help draft.
No Integrity Allegations in Comments: Route suspected research-integrity breaches to the scheme's integrity office separately (see [[platform-instructions]]).
One Living Draft in Git: One draft per review, versioned by git history — no -v2 / -v3 files, no carried-forward appendix. Cut critique lives in the history, not in a growing reserve block (see [[review-template]]).
Every Worked Example Verbatim-True: Any example in these skill files that quotes an application must be a real, grep-checked string. A fabricated "illustrative" quote in the doctrine is how the doctrine teaches the bug. Generic placeholders ("{quoted phrase}") are fine; invented specifics are not.

Execution model — serial core, parallel optional

The loop is written so one agent can run it alone, in series, wearing each stage as a hat (read the stage's reference file → do the stage → move on). That is the portable default; it needs no fleet infrastructure.

When you have many reviews in a round and the infrastructure to fan out, two stages parallelise — in opposite directions:

VERIFY silos: one independent, contextless verifier per application (isolation prevents cross-contamination). Prefer a fresh sub-agent for VERIFY even in serial mode — a single agent verifying its own draft is a weaker adversary than a cold reader.
DE-TEMPLATE is whole-set: a single pass over the entire round's drafts at once (fingerprints only exist across documents).

Dispatch constraint. Fan-out is a privilege of a top-level / lead session. If this skill is itself running inside a leaf sub-agent, it cannot fan out a verifier panel — fall back to the serial hat-switch, or escalate to the caller that a panel needs a top-level session (see specs/review-dispatch-topology.md).

The five-stage loop

The number of passes is not fixed (informal iteration): a clean application may need one VERIFY cycle; a contested one several. Exit a stage when its checks pass, not after a prescribed count.

Stage 0 — Intake & setup (once per round)

Capture the assignment: scheme/round, application ID, candidate, institution, title, role (Detailed vs General Assessor / CoE vs collegial — see [[reviewer-roles]]), deadline.
CoI scan, once. Scan participant lists and references for institutional, personal, collaborator, supervisory, or financial relationships → declare → carry the result as a one-line header flag, then drop it. Do not re-litigate CoI every turn. The single residual that must recur: "confirm against your own records" — the scan cannot see private collaboration history (see [[reviewer-roles]]).
Fetch this round's criteria, weights, bands, and character minimums fresh from the handbook (PKB or the scheme site). Do not reuse last round's.
Assemble materials under ${ACA_DATA:-~/brain}/reviews/{scheme}/{appid}/. Convert: pdftotext -layout source.pdf source.txt. Honest tooling note: -layout gives layout-preserved text, not line numbers, and it mangles tables / GANTT / budget — read those from the PDF pages directly. Build a section→line map in the reading notes.

Stage 1 — PREP (probe-driven draft)

Read the application against the analytical probes ([[review-probes]]) and produce two deliberately separated things ([[reading-notes-format]]):

a factual, line-mapped notes file (where-to-look + the application's claims, line-cited); and
a raw judgement block — your actual hunches and reservations, kept verbatim and never tidied, quarantined from submission text. This is where the real assessment forms; bless it as first-class, do not sanitise it into blandness.

Then draft evidence-cited per-criterion comments applying the probes, following the scheme's bands ([[review-guidance]]). Scores left blank.

PREP self-review before finishing (guard F3 — PREP is the weakest self-checking link):

No integrity-adjacent or third-party-identifying material in prose (route separately).
Recompute any number before asserting it — never quote a total you haven't re-derived.
Mark paraphrase as paraphrase. Never write a phrase a downstream reader could mistake for a quotable string. This is the exact seed of a fabricated quote. Do not dress a guess as a fact with a [?].

Stage 2 — VERIFY (adversarial, independent) — the highest-value stage

Run as a separate contextless pass (fan-out) or an explicit, forceful hat-switch: "You are now an adversarial verifier. The draft's authors are not to be trusted. Distrust any prior PASS. Demand proof." Full method in [[review-verification]]. The core:

Claim-by-claim: classify every claim CONFIRMED (line# + verbatim quote) / UNSUPPORTED (say where you looked) / WRONG (contradicting quote + line). Absence-claims require a documented grep with synonyms.
Verbatim-quote-existence sweep (the single highest-value check): every quoted string in the draft must appear verbatim in source. A null grep on a quoted phrase is a presumptive BLOCKER.
Tool-backed arithmetic: recompute every total from its components (python3 -c …, not mental math). Quote no total you haven't re-derived.
Read the PDF pages for GANTT / budget / timeline; run an internal-contradiction sweep across fields (FTE vs "retired"; dates vs funding window; refs vs in-text).
Independent gap analysis: read the application cold and find strengths and weaknesses the draft missed; check balance across criteria. (The highest-value catches come from here, not from checking the draft's own claim list.)
Verify your own corrections before asserting them.
Classify findings on the severity ladder BLOCKER / FIX / NIT ([[review-verification]]).

Stage 3 — VOICE & DE-TEMPLATE

Convert the verified draft into clean, signable prose in the academic's voice ([[voice-and-detemplating]]):

Two registers: prep is maximally specific; the final prose is generally-assertable from one careful reading without re-checking any number. Everything cut is preserved in git history, not a carried appendix.
Render the academic's stated position boldly. Raise register worries in conversation; never soften silently. The systematic failure mode here is under-assertion, not over-assertion.
Cut to roughly one reservation per criterion, framed constructively, anchored to the application's own promises; aim support/resourcing critiques at institutions, never the vulnerable researcher.
Voice onboarding: if the academic's voice file ({academic}-style.md) isn't trained yet, run the reflection-on-diff loop (agent drafts → human edits → agent diffs and codifies the delta) over the first few reviews. A static style guide alone is insufficient.
Whole-set de-template (only when >1 review in a round): census recurring pivot formulas across the set → budget ≤1 use per formula → two-tier rewrite (re-mechanise the load-bearing critique sentences; light synonym de-dup is fine only for low-stakes praise) → re-scan for new fingerprints the fix introduced → machine-check pin-cite counts unchanged.

Stage 4 — FINAL-CHECK & submit

Pre-"ready" gate (guard F7): verbatim-quote grep + spell + dash-normalise + budget-recompute must pass before a draft is called ready. The final check confirms; it must not be the first place anyone greps a quote.
Run a final contextless confirmation pass (the VERIFY techniques as a safety net).
Boundary gate (guard F1 — the most important structural rule): "verified" attaches to a committed artifact, not an idea. Any regeneration after VERIFY — a voice rewrite, a polish pass, a hand-edit — re-enters verification. Stamp the verified commit; re-run the checks on any later diff. On-disk drafts are mutable and get rewritten post-hoc (F8): trust the committed/verified artifact, never a live draft, as the record.
The academic owns scores, top-line read, net call, and submission. Submit; save the confirmation to the task file.

Side-stage — Integrate external / cross-model feedback (optional)

When external or cross-model review comments arrive, integrate them with a distrust default ([[external-feedback]]): per-claim adjudication, a two-axis filter (is it already covered, sharper? does its framing reverse a position we reached?), and a one-line justification per ADD / MODIFY / REJECT. Cross-model's proven value is distillation (of the academic's analytical signature) and corroboration of emphasis — not new content. A final cross-model truth-check is reasonable but optional, not the primary safety net.

Cross-cutting guards (F1–F8)

Guard	Rule
F1	Any regeneration re-enters verification; "verified" stamps a committed artifact. — [[review-verification]]
F2	VERIFY is always a full independent cold re-read + re-derivation — never a citation-checker. Includes the verbatim-quote-existence sweep and tool-backed arithmetic. — [[review-verification]]
F3	PREP self-reviews before writing (integrity material, un-recomputed numbers, paraphrase-as-paraphrase).
F4	Any integration/fix step re-reads the edited artifact and re-derives every number/quote it lands; uses the narrowest possible edit target. — [[review-verification]]
F5	Silo per-application for fact-checking; one whole-set pass for anything cross-document (fingerprints, conflicts, consistency).
F6	Severity ladder with explicit BLOCKER / FIX / NIT definitions ([[review-verification]]).
F7	Verification techniques live in a pre-"ready" gate, not only at the end. — [[review-verification]]
F8	On-disk drafts are mutable and get rewritten post-hoc — never treat a live draft as proof of a stage's output; the committed/verified artifact is the record. — [[review-verification]]

Scribe Mode (collegial drafts)

For reviewing draft papers / proposals written by colleagues before submission:

Assemble materials under ${ACA_DATA:-~/brain}/reviews/{author}/.
Generate reading notes ([[reading-notes-format]]) mapping the author's questions to line numbers, plus a raw judgement block.
Draft feedback in the academic's voice ([[voice-and-detemplating]]). The author is in the room — be direct but constructive; the academic provides the judgment, the agent supports.

References

[[review-probes]] — the analytical core: the probes PREP reads the application against
[[review-verification]] — the adversarial VERIFY stage: techniques, severity ladder, self-verify
[[voice-and-detemplating]] — two registers, voice onboarding, whole-set de-templating
[[external-feedback]] — integrating external / cross-model comments, distrust-default
[[reviewer-roles]] — role distinctions (Detailed vs General Assessor vs collegial vs journal)
[[review-guidance]] — taxonomy, scoring bands, score–text alignment, anti-bias drafting discipline
[[review-template]] — assessment file template
[[reading-notes-format]] — reading-notes layout (factual notes + raw judgement block)
[[platform-instructions]] — submission platform tips; integrity / foreign-affiliation routing
PKB (optional): search for {scheme}-Assessor-Handbook or {scheme} Grant Guidelines; if no PKB, fetch the handbook from the scheme's site each round.