Skip to main content
Run any Skill in Manus
with one click

local-rl-alignment-engineering

本地基座模型强化学习对齐工程实践 - 涵盖 RLHF/DPO/GRPO 算法选型、显存优化、框架选择、数据工程与全流程实施指南

Overview

本地基座模型强化学习对齐工程实践 - 涵盖 RLHF/DPO/GRPO 算法选型、显存优化、框架选择、数据工程与全流程实施指南

Install command
npx skills add https://github.com/hiyenwong/ai_collection --skill local-rl-alignment-engineering

Copy and paste this command into Claude Code to install the skill

Stars1
Forks0
UpdatedJune 4, 2026 at 02:00
File Explorer
2 files
SKILL.md
readonly