Skip to main content
Run any Skill in Manus
with one click
$pwd:

parallelism-strategies

// Operational guide for choosing and combining parallelism strategies (TP/PP/DP/CP/SP/EP) for the SkyRL Megatron backend. Use when sizing parallelism for a new model, debugging OOM/throughput on a given cluster topology, or extending an existing recipe to a new GPU count. Includes model-size sizing rules, hardware topology mapping, sequence-length thresholds, MoE-specific patterns, and pitfalls.

$ git log --oneline --stat
stars:1,912
forks:339
updated:May 17, 2026 at 23:33
SKILL.md
readonly