Skip to main content
Manusで任意のスキルを実行
ワンクリックで

slime-user

Guide for using SLIME (LLM post-training framework for RL Scaling). Use when working with SLIME for reinforcement learning training of language models, including setup, configuration, training execution, multi-turn interactions, custom reward models, tool calling scenarios, or troubleshooting SLIME workflows. Covers GRPO, GSPO, PPO, Reinforce++, multi-agent RL, VLM training, FSDP/Megatron backends, SGLang integration, dynamic sampling, and custom generation functions.

概要

Guide for using SLIME (LLM post-training framework for RL Scaling). Use when working with SLIME for reinforcement learning training of language models, including setup, configuration, training execution, multi-turn interactions, custom reward models, tool calling scenarios, or troubleshooting SLIME workflows. Covers GRPO, GSPO, PPO, Reinforce++, multi-agent RL, VLM training, FSDP/Megatron backends, SGLang integration, dynamic sampling, and custom generation functions.

インストールコマンド
npx skills add https://github.com/yzlnew/infra-skills --skill slime-user

このコマンドをClaude Codeにコピー&ペーストしてスキルをインストール

スター128
フォーク9
更新日2026年1月14日 05:46
ファイルエクスプローラー
4 ファイル
SKILL.md
readonly