Skip to main content
Execute qualquer Skill no Manus
com um clique
$pwd:

add-inference-backend

// Add a new hardware inference backend to AutoRound for deploying quantized models (e.g., CUDA/Marlin, Triton, CPU, HPU, ARK). Use when implementing QuantLinear kernels, registering backend capabilities, or enabling quantized model inference on a new hardware platform.

$ git log --oneline --stat
stars:1.425
forks:134
updated:11 de maio de 2026 às 02:30
SKILL.md
readonly