Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic

benchmark-audit

// Audit benchmark suites against ABC framework (Task/Outcome/Reporting validity). Checks instruction quality, verifier correctness, reproducibility. Triggers on benchmark audit, audit benchmark, abc audit, task validity.

$ git log --oneline --stat
stars:25
forks:3
updated:17 mars 2026 à 01:39
SKILL.md
readonly