Skip to main content
在 Manus 中运行任何 Skill
一键导入
$pwd:

train-with-environments

// Train models with verifiers environments using hosted RL or prime-rl. Use when asked to configure RL runs, tune key hyperparameters, diagnose instability, set up difficulty filtering, or create practical train and eval loops for new environments.

$ git log --oneline --stat
stars:4,143
forks:553
updated:2026年5月30日 08:21
SKILL.md
readonly