Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

slime-user

Guide for using SLIME (LLM post-training framework for RL Scaling). Use when working with SLIME for reinforcement learning training of language models, including setup, configuration, training execution, multi-turn interactions, custom reward models, tool calling scenarios, or troubleshooting SLIME workflows. Covers GRPO, GSPO, PPO, Reinforce++, multi-agent RL, VLM training, FSDP/Megatron backends, SGLang integration, dynamic sampling, and custom generation functions.

نظرة عامة

Guide for using SLIME (LLM post-training framework for RL Scaling). Use when working with SLIME for reinforcement learning training of language models, including setup, configuration, training execution, multi-turn interactions, custom reward models, tool calling scenarios, or troubleshooting SLIME workflows. Covers GRPO, GSPO, PPO, Reinforce++, multi-agent RL, VLM training, FSDP/Megatron backends, SGLang integration, dynamic sampling, and custom generation functions.

أمر التثبيت
npx skills add https://github.com/yzlnew/infra-skills --skill slime-user

انسخ والصق هذا الأمر في Claude Code لتثبيت المهارة

النجوم١٢٨
التفرعات٩
آخر تحديث١٤ يناير ٢٠٢٦ في ٠٥:٤٦
مستكشف الملفات
4 ملفات
SKILL.md
readonly