Ejecuta cualquier Skill en Manus
con un clic

Ejecuta cualquier Skill en Manus con un clic

research-pipeline-runner

Estrellas472

Forks36

Actualizado17 de abril de 2026, 07:16

Run this repo’s Units+Checkpoints research pipelines end-to-end (survey/brief/paper-review/evidence-review/idea/tutorial/graduate-paper), with workspaces + checkpoints. **Trigger**: run pipeline, kickoff, 继续执行, 自动跑, 写一篇, survey/brief/review/调研/教程/系统综述/审稿. **Use when**: 用户希望端到端跑流程（创建 `workspaces/<name>/`、生成/执行 `UNITS.csv`、遇到 HUMAN checkpoint 停下等待）。 **Skip if**: 用户明确要手工逐条执行（用 `unit-executor`），或你不应自动推进到 prose 阶段。 **Network**: depends on selected pipeline (arXiv/PDF/citation verification may need network; offline import supported where available). **Guardrail**: 必须尊重 checkpoints（无 Approve 不写 prose）；遇到 HUMAN 单元必须停下等待；禁止在 repo root 创建 workspace 工件。

Instalación

Instalar con Codex o Claude Copia este prompt, pégalo en Codex, Claude u otro asistente, y deja que revise la página de la skill y la instale por ti.

Ejecutar en Manus

Fuente

WILLOSCAR

WILLOSCAR/research-units-pipeline-skills

Abrir repositorio de GitHub Ver repositorios del creador

Descarga

Ejecutar en Manus

Ocupaciones relacionadasSOC

Basado en la clasificación ocupacional SOC

Desarrolladores de softwareOcupaciones informáticas y matemáticas·SOC 15-1252

SKILL.md

readonly

name

research-pipeline-runner

description

Research Pipeline Runner

Goal: let a user trigger a full pipeline with one natural-language request, while keeping the run auditable (Units + artifacts + checkpoints).

This skill is coordination:

semantic work is done by the relevant skills’ SKILL.md
scripts are deterministic helpers (scaffold/validate/compile), not the author

Inputs

User goal (one sentence is enough), e.g.:
- “给我写一个 agent 的 arxiv-survey-latex”
Optional:
- explicit pipeline path (e.g., pipelines/arxiv-survey-latex.pipeline.md)
- constraints (time window, language: EN/中文, evidence_mode: abstract/fulltext)

Outputs

A workspace under workspaces/<name>/ containing:
- STATUS.md, GOAL.md, PIPELINE.lock.md, UNITS.csv, CHECKPOINTS.md, DECISIONS.md
- pipeline-specific artifacts (papers/outline/sections/output/latex)

Non-negotiables

Use UNITS.csv as the execution contract; one unit at a time.
Respect checkpoints (CHECKPOINTS.md): no long prose until required approvals are recorded in DECISIONS.md (survey default: C2).
Stop at HUMAN checkpoints and wait for explicit sign-off.
Never create workspace artifacts in the repo root; always use workspaces/<name>/.

Decision tree: pick a pipeline

User goal → choose:

Survey/综述/调研 + Markdown draft → pipelines/arxiv-survey.pipeline.md
Survey/综述/调研 + PDF output → pipelines/arxiv-survey-latex.pipeline.md
Research brief / rapid review / 速览 → pipelines/research-brief.pipeline.md
Paper review / paper critique / 审稿 → pipelines/paper-review.pipeline.md
Evidence review / systematic review / 系统综述 → pipelines/evidence-review.pipeline.md
Idea finding / 选题 / 点子 / 找方向 → pipelines/idea-brainstorm.pipeline.md
Tutorial/教程 → pipelines/source-tutorial.pipeline.md

Recommended run loop (skills-first)

Initialize workspace (C0):

create workspaces/<name>/
write GOAL.md, lock pipeline (PIPELINE.lock.md), seed queries.md

Execute units sequentially:

follow each unit’s SKILL.md to produce the declared outputs
only mark DONE when acceptance criteria are satisfied and outputs exist

Stop at HUMAN checkpoints:

default survey checkpoint is C2 (scope + outline)
write a concise approval request in DECISIONS.md and wait

Writing-stage self-loop (when drafts look thin/template-y):

prefer local fixes over rewriting everything:
- writer-context-pack (C4→C5 bridge) makes packs debuggable
- subsection-writer writes per-file units
- writer-selfloop fixes only failing sections/*.md
- paragraph-curator / style-harmonizer / opener-variator converge structure and de-template the prose
- evaluation-anchor-checker is the late section-level numeric hygiene sweep before merge
- draft-polisher removes generator voice without changing citation keys

Strict-mode behavior (by design)

In --strict runs, several semantic C3/C4 artifacts are treated as scaffolds until explicitly marked refined. This is intentional: it prevents bootstrap JSONL from silently passing into C5 writing (a major source of hollow/templated prose).

Create these markers only after you have manually refined/spot-checked the artifacts:

outline/subsection_briefs.refined.ok
outline/chapter_briefs.refined.ok
outline/evidence_bindings.refined.ok
outline/evidence_drafts.refined.ok
outline/anchor_sheet.refined.ok
outline/writer_context_packs.refined.ok

The runner may BLOCK even if the JSONL exists; add the marker after refinement, then rerun/resume the unit.

Finish:

merge → audit → (optional) LaTeX scaffold/compile

Optional CLI helpers (debug only)

Kickoff + run (optional; convenient, not required): python scripts/pipeline.py kickoff --topic "<topic>" --pipeline <pipeline-name> --run --strict
Resume: python scripts/pipeline.py run --workspace <ws> --strict
Approve checkpoint: python scripts/pipeline.py approve --workspace <ws> --checkpoint C2
Mark refined unit: python scripts/pipeline.py mark --workspace <ws> --unit-id <U###> --status DONE --note "LLM refined"

Handling common blocks

HUMAN approval required: summarize produced artifacts, ask for approval, then record it and resume.
Quality gate blocked (output/QUALITY_GATE.md exists): treat current outputs as scaffolding; refine per the unit’s SKILL.md; mark DONE; resume.
No network: use offline imports (papers/imports/ or arxiv-search --input).
Weak coverage: broaden queries or reduce/merge subsections (outline-budgeter) before writing.

Quality checklist

UNITS.csv statuses reflect actual outputs (no DONE without outputs).
No prose is written unless DECISIONS.md explicitly approves it.
The run stops at HUMAN checkpoints with clear next questions.
In strict mode, scaffold/stub outputs do not get marked DONE without refinement.

Más de este repositorio

mismo repositorio

agent-survey-corpus

WILLOSCAR/research-units-pipeline-skills

Download a small corpus of open-access arXiv survey/review PDFs about agentic systems and extract text for style learning. **Trigger**: agent survey corpus, ref corpus, download surveys, 学习综述写法, 下载 survey. **Use when**: you want to study how real agent surveys structure sections (6–8 H2), size subsections, and write evidence-backed comparisons. **Skip if**: you cannot download PDFs (no network) or you don't want local PDF files. **Network**: required. **Guardrail**: only download arXiv PDFs; store under `ref/` and keep large files out of git.

2026-05-30472

global-reviewer

WILLOSCAR/research-units-pipeline-skills

Global consistency review for survey drafts: terminology, cross-section coherence, and scope/citation hygiene. Writes `output/GLOBAL_REVIEW.md` and (optionally) applies safe edits to `output/DRAFT.md`. **Trigger**: global review, consistency check, coherence audit, 术语一致性, 全局回看, 章节呼应, 拷打 writer. **Use when**: Draft exists and you want a final evidence-first coherence pass before LaTeX/PDF. **Skip if**: You are still changing the outline/mapping/notes (do those first), or prose writing is not approved. **Network**: none. **Guardrail**: Do not invent facts or citations; do not add new citation keys; treat missing evidence as a failure signal.

2026-05-30472

literature-engineer

WILLOSCAR/research-units-pipeline-skills

Multi-route literature expansion + metadata normalization for evidence-first surveys. Produces a large candidate pool (`papers/papers_raw.jsonl`, target ≥1200) with stable IDs and provenance, ready for dedupe/rank + citation generation. **Trigger**: evidence collector, literature engineer, 文献扩充, 多路召回, snowballing, cited by, references, 元信息增强, provenance. **Use when**: 需要把候选文献扩充到 ≥1200 篇并补齐可追溯 meta（survey pipeline 的 Stage C1，写作前置 evidence）。 **Skip if**: 已经有高质量 `papers/papers_raw.jsonl`（≥1200 且每条都有稳定标识+来源记录）。 **Network**: 可离线（靠 imports）；雪崩/在线检索需要网络。 **Guardrail**: 不允许编造论文；每条记录必须带稳定标识（arXiv id / DOI / 可信 URL）和 provenance；不写 output/ prose。

2026-05-30472

pdf-text-extractor

WILLOSCAR/research-units-pipeline-skills

Download PDFs (when available) and extract plain text to support full-text evidence, writing `papers/fulltext_index.jsonl` and `papers/fulltext/*.txt`. **Trigger**: PDF download, fulltext, extract text, papers/pdfs, 全文抽取, 下载PDF. **Use when**: `queries.md` 设置 `evidence_mode: fulltext`（或你明确需要全文证据）并希望为 paper notes/claims 提供更强 evidence。 **Skip if**: `evidence_mode: abstract`（默认）；或你不希望进行下载/抽取（成本/权限/时间）。 **Network**: fulltext 下载通常需要网络（除非你手工提供 PDF 缓存在 `papers/pdfs/`）。 **Guardrail**: 缓存下载到 `papers/pdfs/`；默认不覆盖已有抽取文本（除非显式要求重抽）。

2026-05-30472

prose-writer

WILLOSCAR/research-units-pipeline-skills

Write `output/DRAFT.md` (or `output/SNAPSHOT.md`) from an approved outline and evidence packs, using only verified citation keys from `citations/ref.bib`. **Trigger**: write draft, prose writer, snapshot, survey writing, 写综述, 生成草稿, section-by-section drafting. **Use when**: structure is approved (`DECISIONS.md` has `Approve C2`) and evidence packs exist (`outline/subsection_briefs.jsonl`, `outline/evidence_drafts.jsonl`). **Skip if**: approvals are missing, or evidence packs are incomplete / scaffolded (missing-fields, TODO markers). **Network**: none. **Guardrail**: do not invent facts or citations; only cite keys present in `citations/ref.bib`; avoid pipeline-jargon leakage in final prose.

2026-05-30472

schema-normalizer

WILLOSCAR/research-units-pipeline-skills

Normalize cross-skill JSONL interfaces (ids + titles + citation key formats) so downstream skills do not rely on best-effort joins. **Trigger**: schema normalize, jsonl contract, interface drift, join drift, 字段不一致, schema 规范化. **Use when**: you have generated C2-C4 JSONL artifacts (outline/briefs/bindings/packs/anchors) and want deterministic, stable fields before self-loops/writing. **Skip if**: you are not using the survey pipelines, or the workspace already has a fresh PASS `output/SCHEMA_NORMALIZATION_REPORT.md` for the current artifacts. **Network**: none. **Guardrail**: NO PROSE; deterministic transforms only; do not invent evidence/claims; only fill missing ids/titles from `outline/outline.yml`.

2026-05-30472

name

research-pipeline-runner

description

Research Pipeline Runner

Goal: let a user trigger a full pipeline with one natural-language request, while keeping the run auditable (Units + artifacts + checkpoints).

This skill is coordination:

semantic work is done by the relevant skills’ SKILL.md
scripts are deterministic helpers (scaffold/validate/compile), not the author

Inputs

User goal (one sentence is enough), e.g.:
- “给我写一个 agent 的 arxiv-survey-latex”
Optional:
- explicit pipeline path (e.g., pipelines/arxiv-survey-latex.pipeline.md)
- constraints (time window, language: EN/中文, evidence_mode: abstract/fulltext)

Outputs

A workspace under workspaces/<name>/ containing:
- STATUS.md, GOAL.md, PIPELINE.lock.md, UNITS.csv, CHECKPOINTS.md, DECISIONS.md
- pipeline-specific artifacts (papers/outline/sections/output/latex)

Non-negotiables

Use UNITS.csv as the execution contract; one unit at a time.
Respect checkpoints (CHECKPOINTS.md): no long prose until required approvals are recorded in DECISIONS.md (survey default: C2).
Stop at HUMAN checkpoints and wait for explicit sign-off.
Never create workspace artifacts in the repo root; always use workspaces/<name>/.

Decision tree: pick a pipeline

User goal → choose:

Survey/综述/调研 + Markdown draft → pipelines/arxiv-survey.pipeline.md
Survey/综述/调研 + PDF output → pipelines/arxiv-survey-latex.pipeline.md
Research brief / rapid review / 速览 → pipelines/research-brief.pipeline.md
Paper review / paper critique / 审稿 → pipelines/paper-review.pipeline.md
Evidence review / systematic review / 系统综述 → pipelines/evidence-review.pipeline.md
Idea finding / 选题 / 点子 / 找方向 → pipelines/idea-brainstorm.pipeline.md
Tutorial/教程 → pipelines/source-tutorial.pipeline.md

Recommended run loop (skills-first)

Initialize workspace (C0):

create workspaces/<name>/
write GOAL.md, lock pipeline (PIPELINE.lock.md), seed queries.md

Execute units sequentially:

follow each unit’s SKILL.md to produce the declared outputs
only mark DONE when acceptance criteria are satisfied and outputs exist

Stop at HUMAN checkpoints:

default survey checkpoint is C2 (scope + outline)
write a concise approval request in DECISIONS.md and wait

Writing-stage self-loop (when drafts look thin/template-y):

prefer local fixes over rewriting everything:
- writer-context-pack (C4→C5 bridge) makes packs debuggable
- subsection-writer writes per-file units
- writer-selfloop fixes only failing sections/*.md
- paragraph-curator / style-harmonizer / opener-variator converge structure and de-template the prose
- evaluation-anchor-checker is the late section-level numeric hygiene sweep before merge
- draft-polisher removes generator voice without changing citation keys

Strict-mode behavior (by design)

Create these markers only after you have manually refined/spot-checked the artifacts:

outline/subsection_briefs.refined.ok
outline/chapter_briefs.refined.ok
outline/evidence_bindings.refined.ok
outline/evidence_drafts.refined.ok
outline/anchor_sheet.refined.ok
outline/writer_context_packs.refined.ok

The runner may BLOCK even if the JSONL exists; add the marker after refinement, then rerun/resume the unit.

Finish:

merge → audit → (optional) LaTeX scaffold/compile

Optional CLI helpers (debug only)

Kickoff + run (optional; convenient, not required): python scripts/pipeline.py kickoff --topic "<topic>" --pipeline <pipeline-name> --run --strict
Resume: python scripts/pipeline.py run --workspace <ws> --strict
Approve checkpoint: python scripts/pipeline.py approve --workspace <ws> --checkpoint C2
Mark refined unit: python scripts/pipeline.py mark --workspace <ws> --unit-id <U###> --status DONE --note "LLM refined"

Handling common blocks

HUMAN approval required: summarize produced artifacts, ask for approval, then record it and resume.
Quality gate blocked (output/QUALITY_GATE.md exists): treat current outputs as scaffolding; refine per the unit’s SKILL.md; mark DONE; resume.
No network: use offline imports (papers/imports/ or arxiv-search --input).
Weak coverage: broaden queries or reduce/merge subsections (outline-budgeter) before writing.

Quality checklist

UNITS.csv statuses reflect actual outputs (no DONE without outputs).
No prose is written unless DECISIONS.md explicitly approves it.
The run stops at HUMAN checkpoints with clear next questions.
In strict mode, scaffold/stub outputs do not get marked DONE without refinement.