Skip to main content
Run any Skill in Manus
with one click
$pwd:

hipfire-kernel-tuning

// Optimize hipfire HIP/compute kernels — pick a tuning lever (multi-row, K-tile depth, prefetch, wave-size port, WMMA/MFMA, fused projections, ISA flags) and validate the win across the supported RDNA arch matrix. Use when you've identified a hot kernel, want to land a real perf win, and need to NOT regress on archs you don't have hardware for. Codifies the methodology from this repo's actual perf history — wave64 CDNA3 port (commit 4105035, 2× decode), nontemporal-load revert (34eb024, -13% caught only by clean-baseline bisect), gfx12 WMMA port (PR

$ git log --oneline --stat
stars:411
forks:44
updated:May 18, 2026 at 23:49
File Explorer
6 files
SKILL.md
readonly