Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic

rl-training-diagnoser

Use when analyzing RL training status from W&B or local logged metrics, especially actor KL loss, PPO KL, grad norm, clipfrac, critic score/reward/advantages, response length, global sequence length, entropy, invalid draft rates, and diagnosing failures by checking run config and Search-R1/verl implementation.

Étoiles1
Forks0
Mis à jour18 avril 2026 à 07:31
Explorateur de fichiers
4 fichiers
SKILL.md
readonly