Skip to main content
Ejecuta cualquier Skill en Manus
con un clic
$pwd:

parallelism-strategies

// Operational guide for choosing and combining parallelism strategies (TP/PP/DP/CP/SP/EP) for the SkyRL Megatron backend. Use when sizing parallelism for a new model, debugging OOM/throughput on a given cluster topology, or extending an existing recipe to a new GPU count. Includes model-size sizing rules, hardware topology mapping, sequence-length thresholds, MoE-specific patterns, and pitfalls.

$ git log --oneline --stat
stars:1912
forks:339
updated:17 de mayo de 2026, 23:33
SKILL.md
readonly