Skip to main content
Ejecuta cualquier Skill en Manus
con un clic
$pwd:

eval-run

// Execute skill evaluation against test cases, score with judges, and report results. Requires eval.yaml (generated by /eval-analyze). Use when the user wants to test a skill, run eval, benchmark, compare models, detect regressions, check skill quality, or verify changes didn't break anything. Triggers on "run eval", "test the skill", "evaluate", "benchmark", "check for regressions", "how does my skill perform", "score the skill", "run the tests", "run my evals", "compare against baseline", "did I break anything", "test my changes". Also called by /eval-optimize for automated iterations.

$ git log --oneline --stat
stars:15
forks:18
updated:29 de mayo de 2026, 17:16
Explorador de archivos
13 archivos
SKILL.md
readonly