Skip to main content
Ejecuta cualquier Skill en Manus
con un clic

llm-serving-expert

LLM serving expert: vLLM, TensorRT-LLM, Triton Inference Server, quantization (INT8/FP8/GPTQ/AWQ), continuous batching, PagedAttention, KV cache management. Use when deploying LLMs for inference.

Resumen

LLM serving expert: vLLM, TensorRT-LLM, Triton Inference Server, quantization (INT8/FP8/GPTQ/AWQ), continuous batching, PagedAttention, KV cache management. Use when deploying LLMs for inference.

Comando de instalación
npx skills add https://github.com/theneoai/awesome-skills --skill llm-serving-expert

Copia y pega este comando en Claude Code para instalar la habilidad

Estrellas75
Forks28
Actualizado30 de abril de 2026, 04:37
Explorador de archivos
5 archivos
SKILL.md
readonly