Skip to main content
在 Manus 中运行任何 Skill
一键导入
$pwd:

trl-training

// Train and fine-tune transformer language models using TRL (Transformers Reinforcement Learning). Supports SFT, DPO, GRPO, KTO, RLOO and Reward Model training via CLI commands.

$ git log --oneline --stat
stars:18,449
forks:2,736
updated:2026年2月16日 16:02
SKILL.md
readonly