Skip to main content
Run any Skill in Manus
with one click
$pwd:

running-eval-suite

// Run all reference benchmarks under benchmarks/eval/ and refresh the reference-table cells in benchmark_*.py. Auto-detects host hardware (H200/H100/H800/...). For each row's matching hw entry: replace cells in place. For new hardware or a new workload tag: append a new row to the section's table. Local-Pipeline-Result tables are skipped. One slash command runs everything end-to-end and auto-commits.

$ git log --oneline --stat
stars:296
forks:131
updated:May 22, 2026 at 16:19
File Explorer
3 files
SKILL.md
readonly