Skip to main content
在 Manus 中运行任何 Skill
一键导入

llm-serving-expert

LLM serving expert: vLLM, TensorRT-LLM, Triton Inference Server, quantization (INT8/FP8/GPTQ/AWQ), continuous batching, PagedAttention, KV cache management. Use when deploying LLMs for inference.

概览

LLM serving expert: vLLM, TensorRT-LLM, Triton Inference Server, quantization (INT8/FP8/GPTQ/AWQ), continuous batching, PagedAttention, KV cache management. Use when deploying LLMs for inference.

安装命令
npx skills add https://github.com/theneoai/awesome-skills --skill llm-serving-expert

复制此命令并粘贴到 Claude Code 中以安装该技能

星标75
分支28
更新时间2026年4月30日 04:37
文件资源管理器
5 个文件
SKILL.md
readonly