Skip to main content
Jeden Skill in Manus ausführen
mit einem Klick
$pwd:

add-inference-backend

// Add a new hardware inference backend to AutoRound for deploying quantized models (e.g., CUDA/Marlin, Triton, CPU, HPU, ARK). Use when implementing QuantLinear kernels, registering backend capabilities, or enabling quantized model inference on a new hardware platform.

$ git log --oneline --stat
stars:1.425
forks:134
updated:11. Mai 2026 um 02:30
SKILL.md
readonly