Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic
$pwd:

parallelism-strategies

// Operational guide for choosing and combining parallelism strategies (TP/PP/DP/CP/SP/EP) for the SkyRL Megatron backend. Use when sizing parallelism for a new model, debugging OOM/throughput on a given cluster topology, or extending an existing recipe to a new GPU count. Includes model-size sizing rules, hardware topology mapping, sequence-length thresholds, MoE-specific patterns, and pitfalls.

$ git log --oneline --stat
stars:1 912
forks:339
updated:17 mai 2026 à 23:33
SKILL.md
readonly