Skip to main content
Execute qualquer Skill no Manus
com um clique

llm-serving-expert

LLM serving expert: vLLM, TensorRT-LLM, Triton Inference Server, quantization (INT8/FP8/GPTQ/AWQ), continuous batching, PagedAttention, KV cache management. Use when deploying LLMs for inference.

Visão geral

LLM serving expert: vLLM, TensorRT-LLM, Triton Inference Server, quantization (INT8/FP8/GPTQ/AWQ), continuous batching, PagedAttention, KV cache management. Use when deploying LLMs for inference.

Comando de instalação
npx skills add https://github.com/theneoai/awesome-skills --skill llm-serving-expert

Copie e cole este comando no Claude Code para instalar a skill

Estrelas75
Forks28
Atualizado30 de abril de 2026 às 04:37
Explorador de arquivos
5 arquivos
SKILL.md
readonly