Skip to main content
Manus에서 모든 스킬 실행
원클릭으로
$pwd:

vllm-sota-humanize-loop

// Run an autonomous Humanize-governed vLLM SOTA performance loop for one LLM model: first perform the fixed fair vLLM/SGLang/TensorRT-LLM deployment search and benchmark, then start one RLCR loop that repeatedly decides the gap, profiles the current bottleneck, runs layer/kernel pipeline analysis, patches vLLM code, optionally uses ncu-report-skill for kernel evidence, and revalidates until vLLM matches or beats the best observed framework under the same workload and SLA.

$ git log --oneline --stat
stars:483
forks:41
updated:2026년 5월 26일 13:03
파일 탐색기
2 개 파일
SKILL.md
readonly