Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic

eval-validity-review

// Review a single evaluation's validity — whether its claims hold up, whether its name is accurate, whether samples can be both succeeded and failed at, and whether scoring measures ground truth. Use when user asks to check validity of an eval, or as part of the Master Checklist workflow. Do NOT use for code quality or test coverage (use eval-quality-workflow or ensure-test-coverage instead).

$ git log --oneline --stat
stars:518
forks:336
updated:30 avril 2026 à 16:50
SKILL.md
readonly