Skip to main content
Manusで任意のスキルを実行
ワンクリックで
$pwd:

eval-analyze

// Analyze a skill and generate eval.yaml for the agent eval harness. Deeply examines the skill's SKILL.md, sub-skills, scripts, and test cases to produce the full evaluation config — execution mode, dataset schema, output descriptions, judges, models, and thresholds. Use this skill whenever someone wants to set up evaluation, test a skill, add quality checks, benchmark a skill, or just created a new skill and needs eval infrastructure. Also triggered automatically by /eval-run when eval.yaml is missing. Even if the user just says "how do I know if my skill is working?" — this is the right starting point.

$ git log --oneline --stat
stars:15
forks:18
updated:2026年5月29日 13:49
ファイルエクスプローラー
8 ファイル
SKILL.md
readonly