#001SkyRL1 个 skills1.9k339更新于 2026-05-17占该创作者 100%skill职业分类描述更新parallelism-strategies数据科学家Operational guide for choosing and combining parallelism strategies (TP/PP/DP/CP/SP/EP) for the SkyRL Megatron backend. Use when sizing parallelism for a new model, debugging OOM/throughput on a given cluster topology, or extending an existing recipe to a new GPU count. Includes model-size sizing rules, hardware topology mapping, sequence-length thresholds, MoE-specific patterns, and pitfalls.2026-05-17