Skip to main content
Jeden Skill in Manus ausführen
mit einem Klick
$pwd:

evaluate-environments

// Run and analyze evaluations for verifiers environments using prime eval. Use when asked to smoke-test environments, run benchmark sweeps, resume interrupted evaluations, compare models, inspect sample-level outputs, or produce evaluation summaries suitable for deciding next steps.

$ git log --oneline --stat
stars:4.143
forks:553
updated:29. Mai 2026 um 23:42
SKILL.md
readonly