Skip to main content
Jeden Skill in Manus ausführen
mit einem Klick
$pwd:

trl-training

// Train and fine-tune transformer language models using TRL (Transformers Reinforcement Learning). Supports SFT, DPO, GRPO, KTO, RLOO and Reward Model training via CLI commands.

$ git log --oneline --stat
stars:18.449
forks:2.736
updated:16. Februar 2026 um 16:02
SKILL.md
readonly