Skip to main content
Run any Skill in Manus
with one click

nemo-mbridge-perf-moe-long-context

Long-context MoE training guidance for Megatron Bridge. Covers CP sizing, selective recompute, dispatcher choices, and practical patterns from DSV3, Qwen3, and Qwen3-Next long-context experiments.

Stars708
Forks355
UpdatedJune 2, 2026 at 19:48
File Explorer
6 files
SKILL.md
readonly