Skip to main content
Run any Skill in Manus
with one click
$pwd:

vllm-sota-humanize-loop

// Run an autonomous Humanize-governed vLLM SOTA performance loop for one LLM model: first perform the fixed fair vLLM/SGLang/TensorRT-LLM deployment search and benchmark, then start one RLCR loop that repeatedly decides the gap, profiles the current bottleneck, runs layer/kernel pipeline analysis, patches vLLM code, optionally uses ncu-report-skill for kernel evidence, and revalidates until vLLM matches or beats the best observed framework under the same workload and SLA.

$ git log --oneline --stat
stars:483
forks:41
updated:May 26, 2026 at 13:03
File Explorer
2 files
SKILL.md
readonly