Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

evaluation

// Evaluates accuracy of quantized or unquantized LLMs using NeMo Evaluator Launcher (NEL). Triggers on "evaluate model", "benchmark accuracy", "run MMLU", "evaluate quantized model", "run nel". Handles deployment, config generation, and evaluation execution. Not for quantizing models (use ptq), deploying/serving models (use deployment), or comparing completed baseline-vs-quantized results (use compare-results).

$ git log --oneline --stat
stars:٢٬٧٤٩
forks:٤٠٥
updated:٢٢ مايو ٢٠٢٦ في ١٨:٠٠
مستكشف الملفات
18 ملفات
SKILL.md
readonly