Skip to main content
Run any Skill in Manus
with one click

vllm-setup

Deploy a vLLM inference server on an NVIDIA DGX Station GB300 with validated container, GPU targeting, and tuning parameters. Use when the user asks to serve a model with vLLM, start a vLLM endpoint, or set up OpenAI-compatible inference on DGX Station.

Skill metadata
Stars918
Forks218
UpdatedMay 30, 2026 at 11:49
SKILL.md
readonly