一键导入
llm-inference-optimization
Quantization, caching, batching, and serving optimization for LLM inference.
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
菜单
Quantization, caching, batching, and serving optimization for LLM inference.
用 Codex 或 Claude 帮你安装 复制这段 Prompt,粘贴到 Codex、Claude 或其他助手里,让它检查 Skill 页面并帮你完成安装。
基于 SOC 职业分类
| name | LLM Inference Optimization |
| description | Quantization, caching, batching, and serving optimization for LLM inference. |
Quantization, caching, batching, and serving optimization for LLM inference.
Use this skill when working on ai engineer tasks related to llm inference optimization.
Affiliate program strategy, link optimization, and commission maximization.
Code generation with LLMs, code review automation, and AI pair programming.
RLHF, constitutional AI, safety evaluation, and alignment techniques.
Astro static site generation, islands architecture, and content collections.
Logo design, brand guidelines, and visual identity systems.
Compelling case studies that showcase results and drive conversions.