Skip to main content
Run any Skill in Manus
with one click

llm-serving-expert

LLM serving expert: vLLM, TensorRT-LLM, Triton Inference Server, quantization (INT8/FP8/GPTQ/AWQ), continuous batching, PagedAttention, KV cache management. Use when deploying LLMs for inference.

Overview

LLM serving expert: vLLM, TensorRT-LLM, Triton Inference Server, quantization (INT8/FP8/GPTQ/AWQ), continuous batching, PagedAttention, KV cache management. Use when deploying LLMs for inference.

Install command
npx skills add https://github.com/theneoai/awesome-skills --skill llm-serving-expert

Copy and paste this command into Claude Code to install the skill

Stars75
Forks28
UpdatedApril 30, 2026 at 04:37
File Explorer
5 files
SKILL.md
readonly