Skip to main content
Manus에서 모든 스킬 실행
원클릭으로
$pwd:

vllm-bench-serve

// Benchmark vLLM or OpenAI-compatible serving endpoints using vllm bench serve. Supports multiple datasets (random, sharegpt, sonnet, HF), backends (openai, openai-chat, vllm-pooling, embeddings), throughput/latency testing with request-rate control, and result saving. Use when benchmarking LLM serving performance, measuring TTFT/TPOT, or load testing inference APIs.

$ git log --oneline --stat
stars:76
forks:22
updated:2026년 4월 3일 14:11
SKILL.md
readonly