Skip to main content
Execute qualquer Skill no Manus
com um clique
$pwd:

vllm-bench-serve

// Benchmark vLLM or OpenAI-compatible serving endpoints using vllm bench serve. Supports multiple datasets (random, sharegpt, sonnet, HF), backends (openai, openai-chat, vllm-pooling, embeddings), throughput/latency testing with request-rate control, and result saving. Use when benchmarking LLM serving performance, measuring TTFT/TPOT, or load testing inference APIs.

$ git log --oneline --stat
stars:76
forks:22
updated:3 de abril de 2026 às 14:11
SKILL.md
readonly