Skip to main content
在 Manus 中运行任何 Skill
一键导入
$pwd:

evaluate-environments

// Run and analyze evaluations for verifiers environments using prime eval. Use when asked to smoke-test environments, run benchmark sweeps, resume interrupted evaluations, compare models, inspect sample-level outputs, or produce evaluation summaries suitable for deciding next steps.

$ git log --oneline --stat
stars:4,143
forks:553
updated:2026年5月29日 23:42
SKILL.md
readonly