Skip to main content
Execute qualquer Skill no Manus
com um clique
$pwd:

vllm-sota-humanize-loop

// Run an autonomous Humanize-governed vLLM SOTA performance loop for one LLM model: first perform the fixed fair vLLM/SGLang/TensorRT-LLM deployment search and benchmark, then start one RLCR loop that repeatedly decides the gap, profiles the current bottleneck, runs layer/kernel pipeline analysis, patches vLLM code, optionally uses ncu-report-skill for kernel evidence, and revalidates until vLLM matches or beats the best observed framework under the same workload and SLA.

$ git log --oneline --stat
stars:483
forks:41
updated:26 de maio de 2026 às 13:03
Explorador de arquivos
2 arquivos
SKILL.md
readonly