Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة
$pwd:

train-with-environments

// Train models with verifiers environments using hosted RL or prime-rl. Use when asked to configure RL runs, tune key hyperparameters, diagnose instability, set up difficulty filtering, or create practical train and eval loops for new environments.

$ git log --oneline --stat
stars:٤٬١٤٣
forks:٥٥٣
updated:٣٠ مايو ٢٠٢٦ في ٠٨:٢١
SKILL.md
readonly