Skip to main content
Run any Skill in Manus
with one click

sglang-setup

Deploy an SGLang inference server on an NVIDIA DGX Station GB300 with the cu130 container, RadixAttention prefix caching, and structured JSON output support. Use when the user asks to serve a model with SGLang, start an SGLang endpoint, or needs structured-output inference on DGX Station.

Skill metadata
Stars918
Forks218
UpdatedMay 30, 2026 at 11:49
SKILL.md
readonly