with one click
llm-inference-optimization
Quantization, caching, batching, and serving optimization for LLM inference.
Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.