Skip to main content
Run any Skill in Manus
with one click
$pwd:

kaggle-benchmarks

// Write benchmark tasks to evaluate LLMs using the kaggle_benchmarks Python library. Covers task decorators, structured outputs, assertions, tools, dataset evaluation, and multi-turn conversations.

$ git log --oneline --stat
stars:149
forks:36
updated:May 15, 2026 at 16:22
File Explorer
2 files
SKILL.md
readonly