Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة
$pwd:

vllm-bench-serve

// Benchmark vLLM or OpenAI-compatible serving endpoints using vllm bench serve. Supports multiple datasets (random, sharegpt, sonnet, HF), backends (openai, openai-chat, vllm-pooling, embeddings), throughput/latency testing with request-rate control, and result saving. Use when benchmarking LLM serving performance, measuring TTFT/TPOT, or load testing inference APIs.

$ git log --oneline --stat
stars:٧٦
forks:٢٢
updated:٣ أبريل ٢٠٢٦ في ١٤:١١
SKILL.md
readonly