Skip to main content
Manus에서 모든 스킬 실행
원클릭으로
$pwd:

trl-training

// Train and fine-tune transformer language models using TRL (Transformers Reinforcement Learning). Supports SFT, DPO, GRPO, KTO, RLOO and Reward Model training via CLI commands.

$ git log --oneline --stat
stars:18,449
forks:2,736
updated:2026년 2월 16일 16:02
SKILL.md
readonly