Skip to main content
Run any Skill in Manus
with one click

model-based-diffusion-policy-optimization

Model-Based Diffusion Policy Optimization (MBDPO) methodology for scaling world-model reinforcement learning. Unifies search and policy optimization through diffusion policy representations addressing structural misalignment. Use for world-model RL, diffusion-based policy learning, offline pretraining, model-based RL scaling.

Overview

Model-Based Diffusion Policy Optimization (MBDPO) methodology for scaling world-model reinforcement learning. Unifies search and policy optimization through diffusion policy representations addressing structural misalignment. Use for world-model RL, diffusion-based policy learning, offline pretraining, model-based RL scaling.

Install command
npx skills add https://github.com/hiyenwong/ai_collection --skill model-based-diffusion-policy-optimization

Copy and paste this command into Claude Code to install the skill

Stars1
Forks0
UpdatedJune 4, 2026 at 02:00
SKILL.md
readonly