Skip to main content
Ejecuta cualquier Skill en Manus
con un clic
$pwd:

evaluate-environments

// Run and analyze evaluations for verifiers environments using prime eval. Use when asked to smoke-test environments, run benchmark sweeps, resume interrupted evaluations, compare models, inspect sample-level outputs, or produce evaluation summaries suitable for deciding next steps.

$ git log --oneline --stat
stars:4143
forks:553
updated:29 de mayo de 2026, 23:42
SKILL.md
readonly