Skip to main content
Ejecuta cualquier Skill en Manus
con un clic

slime-user

Guide for using SLIME (LLM post-training framework for RL Scaling). Use when working with SLIME for reinforcement learning training of language models, including setup, configuration, training execution, multi-turn interactions, custom reward models, tool calling scenarios, or troubleshooting SLIME workflows. Covers GRPO, GSPO, PPO, Reinforce++, multi-agent RL, VLM training, FSDP/Megatron backends, SGLang integration, dynamic sampling, and custom generation functions.

Resumen

Guide for using SLIME (LLM post-training framework for RL Scaling). Use when working with SLIME for reinforcement learning training of language models, including setup, configuration, training execution, multi-turn interactions, custom reward models, tool calling scenarios, or troubleshooting SLIME workflows. Covers GRPO, GSPO, PPO, Reinforce++, multi-agent RL, VLM training, FSDP/Megatron backends, SGLang integration, dynamic sampling, and custom generation functions.

Comando de instalación
npx skills add https://github.com/yzlnew/infra-skills --skill slime-user

Copia y pega este comando en Claude Code para instalar la habilidad

Estrellas128
Forks9
Actualizado14 de enero de 2026, 05:46
Explorador de archivos
4 archivos
SKILL.md
readonly