Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic
$pwd:

hipfire-kernel-tuning

// Optimize hipfire HIP/compute kernels — pick a tuning lever (multi-row, K-tile depth, prefetch, wave-size port, WMMA/MFMA, fused projections, ISA flags) and validate the win across the supported RDNA arch matrix. Use when you've identified a hot kernel, want to land a real perf win, and need to NOT regress on archs you don't have hardware for. Codifies the methodology from this repo's actual perf history — wave64 CDNA3 port (commit 4105035, 2× decode), nontemporal-load revert (34eb024, -13% caught only by clean-baseline bisect), gfx12 WMMA port (PR

$ git log --oneline --stat
stars:411
forks:44
updated:18 mai 2026 à 23:49
Explorateur de fichiers
6 fichiers
SKILL.md
readonly