Skip to main content
Run any Skill in Manus
with one click
$pwd:

add-inference-backend

// Add a new hardware inference backend to AutoRound for deploying quantized models (e.g., CUDA/Marlin, Triton, CPU, HPU, ARK). Use when implementing QuantLinear kernels, registering backend capabilities, or enabling quantized model inference on a new hardware platform.

$ git log --oneline --stat
stars:1,425
forks:134
updated:May 11, 2026 at 02:30
SKILL.md
readonly