Skip to main content
Ejecuta cualquier Skill en Manus
con un clic
$pwd:

test-model-deepseekr1-mtp-tp

// DeepSeek-R1 MTP-TP test: LightLLM api_server with MTP (EAGLE) draft, tensor parallel only (--tp 8, no --dp, no EP MoE), plus GSM8K lm_eval on localhost. Distinct from the MTP-EP-TPDP skill which uses --tp 8 --dp 8 and EP MoE. Requires a dedicated log directory, summary.txt, tokenizer aligned with MODEL_DIR. Use for TP-only MTP gsm8k accuracy runs.

$ git log --oneline --stat
stars:4081
forks:332
updated:22 de mayo de 2026, 00:56
SKILL.md
readonly