Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

megatron-memory-estimator

Estimate GPU memory usage for Megatron-based MoE (Mixture of Experts) and dense models. Use when users need to (1) estimate memory from HuggingFace model configs (DeepSeek-V3, Qwen, etc.), (2) plan GPU resource allocation for training, (3) compare different parallelism strategies (TP/PP/EP/CP), (4) determine if a model fits in available GPU memory, or (5) optimize training configurations for memory efficiency.

نظرة عامة

Estimate GPU memory usage for Megatron-based MoE (Mixture of Experts) and dense models. Use when users need to (1) estimate memory from HuggingFace model configs (DeepSeek-V3, Qwen, etc.), (2) plan GPU resource allocation for training, (3) compare different parallelism strategies (TP/PP/EP/CP), (4) determine if a model fits in available GPU memory, or (5) optimize training configurations for memory efficiency.

أمر التثبيت
npx skills add https://github.com/yzlnew/infra-skills --skill megatron-memory-estimator

انسخ والصق هذا الأمر في Claude Code لتثبيت المهارة

النجوم١٢٨
التفرعات٩
آخر تحديث١٠ يناير ٢٠٢٦ في ٠٢:٠٤
مستكشف الملفات
13 ملفات
SKILL.md
readonly