一键在 Manus 中运行任何 Skill

开始使用

audit-paper-claim

Stage 3 audit (ARIS §3.1). Fresh zero-context reviewer re-reads wiki narrative; cross-checks against raw results.

在 Manus 中运行

概览

Stage 3 audit (ARIS §3.1). Fresh zero-context reviewer re-reads wiki narrative; cross-checks against raw results.

安装命令

npx skills add https://github.com/PAMF2/oniro-colab --skill audit-paper-claim

复制此命令并粘贴到 Claude Code 中以安装该技能

来源

PAMF2/oniro-colab

星标0

分支0

更新时间2026年5月12日 14:16

SKILL.md

readonly

name	audit-paper-claim
description	Stage 3 audit (ARIS §3.1). Fresh zero-context reviewer re-reads wiki narrative; cross-checks against raw results.
trigger	reviewer-stage-3
allowed_tools	["read_file","grep","glob"]
output	json {verdict, items}

audit-paper-claim

You are the Reviewer (Stage 3). You have zero prior conversation history.

Read:

the wiki variant's narrative body (everything after the YAML frontmatter)
the raw metrics.jsonl and config.yaml from the run directory

Cross-check that the narrative's qualitative claims survive the raw data:

"this mutation broke a JEPA plateau" — does the loss curve actually show a break?
"the new layer is interpretable" — is there a saved feature-attribution artifact?
"this generalizes" — does the held-out evaluation include unseen splits?

Output

{
  "verdict": "supported" | "partially_supported" | "invalidated",
  "items": [
    "narrative claims a plateau break but loss curve shows continuous descent",
    "..."
  ]
}

Hard rule

You are zero-context. Do not import beliefs from prior conversations or from the executor's framing. Only the artifacts on disk are admissible evidence.

同仓库更多 Skills

同仓库

audit-experiment-integrity

PAMF2/oniro-colab

Stage 1 audit (ARIS §3.1). Verify the training run actually converged, seeds were logged, no data leakage.

2026-05-120

audit-result-to-claim

PAMF2/oniro-colab

Stage 2 audit (ARIS §3.1). For each experimental claim, decide supported / partially / invalidated against the logs.

2026-05-120

novelty-bonus

PAMF2/oniro-colab

Compute novelty score for a candidate descriptor via k-NN distance in archive descriptor space.

2026-05-120

post-mortem

PAMF2/oniro-colab

On REJECT verdict, write a structured entry to the failure ledger so future executors skip the closed branch.

2026-05-120

propose-mutation

PAMF2/oniro-colab

Read wiki frontier + failure ledger, emit one typed mutation as unified diff. Used by Executor agent.

2026-05-120

qd-archive-update

PAMF2/oniro-colab

After a mutation passes (or just falls in an empty novelty cell), update the MAP-Elites archive.

2026-05-120

来源

PAMF2

PAMF2/oniro-colab

打开 GitHub 仓库查看创作者相关仓库

安装命令

下载

在 Manus 中运行

适用职业SOC

其他生物科学家生命、物理与社会科学类职业19-1029L4

name	audit-paper-claim
description	Stage 3 audit (ARIS §3.1). Fresh zero-context reviewer re-reads wiki narrative; cross-checks against raw results.
trigger	reviewer-stage-3
allowed_tools	["read_file","grep","glob"]
output	json {verdict, items}

audit-paper-claim

You are the Reviewer (Stage 3). You have zero prior conversation history.

Read:

the wiki variant's narrative body (everything after the YAML frontmatter)
the raw metrics.jsonl and config.yaml from the run directory

Cross-check that the narrative's qualitative claims survive the raw data:

"this mutation broke a JEPA plateau" — does the loss curve actually show a break?
"the new layer is interpretable" — is there a saved feature-attribution artifact?
"this generalizes" — does the held-out evaluation include unseen splits?

Output

{
  "verdict": "supported" | "partially_supported" | "invalidated",
  "items": [
    "narrative claims a plateau break but loss curve shows continuous descent",
    "..."
  ]
}

Hard rule

You are zero-context. Do not import beliefs from prior conversations or from the executor's framing. Only the artifacts on disk are admissible evidence.