Skip to main content
在 Manus 中运行任何 Skill
一键导入

skill-forge-eval

// Run evaluation pipelines on Claude Code skills to test triggering accuracy, workflow correctness, and output quality. Spawns executor, grader, comparator, and analyzer sub-agents for parallel evaluation. Generates eval_metadata.json, grading.json, and feedback reports. Use when user says "eval skill", "test skill", "run evals", "evaluate skill", "skill evals", "test skill quality", "run skill tests", or "skill evaluation".

$ git log --oneline --stat
stars:58
forks:28
updated:2026年3月6日 16:30
SKILL.md
readonly