Skip to main content
Run any Skill in Manus
with one click
$pwd:

10x-eval-model

// Set up and run benchmark evaluations for new LLM models in the 10xBench project. Use when the user wants to add a new model to the benchmark, prepare evaluation directories, update metadata, or launch evaluation runs. Triggers on phrases like "eval model", "add model to benchmark", "run benchmark for [model]", "evaluate [model-name]", "set up [model] for eval", or any request involving adding a new model to the Przeprogramowani.pl benchmark pipeline.

$ git log --oneline --stat
stars:5
forks:4
updated:May 6, 2026 at 07:12
SKILL.md
readonly