Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic
$pwd:

eval-run

// Execute skill evaluation against test cases, score with judges, and report results. Requires eval.yaml (generated by /eval-analyze). Use when the user wants to test a skill, run eval, benchmark, compare models, detect regressions, check skill quality, or verify changes didn't break anything. Triggers on "run eval", "test the skill", "evaluate", "benchmark", "check for regressions", "how does my skill perform", "score the skill", "run the tests", "run my evals", "compare against baseline", "did I break anything", "test my changes". Also called by /eval-optimize for automated iterations.

$ git log --oneline --stat
stars:15
forks:18
updated:29 mai 2026 à 17:16
Explorateur de fichiers
13 fichiers
SKILL.md
readonly