Skip to main content
Run any Skill in Manus
with one click
$pwd:

evaluating-llms-harness

// lm-eval-harness: benchmark LLMs (MMLU, GSM8K, etc.).

$ git log --oneline --stat
stars:142,444
forks:22,207
updated:May 8, 2026 at 21:27
File Explorer
5 files
SKILL.md
readonly