Skip to main content
Execute qualquer Skill no Manus
com um clique
$pwd:

parallelism-strategies

// Operational guide for choosing and combining parallelism strategies (TP/PP/DP/CP/SP/EP) for the SkyRL Megatron backend. Use when sizing parallelism for a new model, debugging OOM/throughput on a given cluster topology, or extending an existing recipe to a new GPU count. Includes model-size sizing rules, hardware topology mapping, sequence-length thresholds, MoE-specific patterns, and pitfalls.

$ git log --oneline --stat
stars:1.912
forks:339
updated:17 de maio de 2026 às 23:33
SKILL.md
readonly