Skip to main content
Run any Skill in Manus
with one click

rl-policy-optimization

Stars13,557
Forks1,589
UpdatedMarch 23, 2026 at 01:46

Best practices for reinforcement learning policy optimization. Use when working on RL agents, PPO, SAC, or reward design.

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

SKILL.md
readonly