Skip to main content
Manus에서 모든 스킬 실행
원클릭으로
$pwd:

prefetch-data-load

// Apply prefetch optimization to FlyDSL kernel loops: pre-load the first iteration's data before the loop, issue async loads for the next iteration inside the loop body, and swap buffers at the loop tail via runtime loop-carried values. This overlaps data load latency with compute instructions. Use when a kernel has a loop where buffer_load feeds into MFMA/compute and load latency is exposed. Usage: /prefetch-data-load

$ git log --oneline --stat
stars:192
forks:56
updated:2026년 5월 29일 07:14
SKILL.md
readonly