Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة
$pwd:

vllm-sota-humanize-loop

// Run an autonomous Humanize-governed vLLM SOTA performance loop for one LLM model: first perform the fixed fair vLLM/SGLang/TensorRT-LLM deployment search and benchmark, then start one RLCR loop that repeatedly decides the gap, profiles the current bottleneck, runs layer/kernel pipeline analysis, patches vLLM code, optionally uses ncu-report-skill for kernel evidence, and revalidates until vLLM matches or beats the best observed framework under the same workload and SLA.

$ git log --oneline --stat
stars:٤٨٣
forks:٤١
updated:٢٦ مايو ٢٠٢٦ في ١٣:٠٣
مستكشف الملفات
2 ملفات
SKILL.md
readonly