Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

cuda-kernels

// Provides guidance for writing and benchmarking optimized CUDA kernels for NVIDIA GPUs (H100, A100, T4) targeting HuggingFace diffusers and transformers libraries. Supports models like LTX-Video, Stable Diffusion, LLaMA, Mistral, Qwen, and Qwen3. Includes integration with HuggingFace Kernels Hub (get_kernel) for loading pre-compiled kernels. Includes benchmarking scripts to compare kernel performance against baseline implementations.

$ git log --oneline --stat
stars:١٦
forks:٢
updated:١١ فبراير ٢٠٢٦ في ١١:٢٩
مستكشف الملفات
15 ملفات
SKILL.md
readonly