Skip to main content
Manusで任意のスキルを実行
ワンクリックで

pxi-eval-dataset

// Generate synthetic evaluation datasets for the PXI eval harness (evals/pxi/). Use whenever the user asks to create, author, draft, expand, or audit an eval dataset for a PXI tool, skill, or behavior — including phrases like "write evals for <tool>", "test PXI behavior", "synthetic dataset for PXI", "cover this tool with eval examples", or "find gaps in our PXI eval coverage". Inspects whichever evaluators currently live under evals/pxi/evaluators/ at use time and pauses to recommend a new evaluator if the behavior under test can't be scored by what already exists.

$ git log --oneline --stat
stars:9,927
forks:905
updated:2026年5月22日 17:12
ファイルエクスプローラー
2 ファイル
SKILL.md
readonly