Skip to main content
Manusで任意のスキルを実行
ワンクリックで

eval-harness

// Use when you need to evaluate an LLM pipeline or AI feature systematically — sets up an eval harness with test cases, scoring rubrics, and pass/fail tracking rather than one-off manual spot-checks

$ git log --oneline --stat
stars:34
forks:10
updated:2026年5月29日 07:11
SKILL.md
readonly