Skip to main content
Execute qualquer Skill no Manus
com um clique
$pwd:

llm-serving-capacity-planner

// Parse SGLang/vLLM startup logs to explain GPU memory use and request capacity. Use for KV cache budget, mem-fraction-static comparisons, OOM triage, and max-concurrency estimates.

$ git log --oneline --stat
stars:483
forks:41
updated:20 de maio de 2026 às 12:13
Explorador de arquivos
4 arquivos
SKILL.md
readonly