Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة
$pwd:

profile-drill

// Direct per-kernel time analysis from JAX / TensorFlow xplane traces via `utils/profile_drill.py`. Use when the user asks for a per-kernel breakdown, step-time composition, cross-variant kernel comparison, main-stream-blocking analysis, or any question that needs ground-truth kernel timings below what TraceLens reports. Triggers include "xplane", "trace.json.gz", "input_scatter_fusion", "RaggedAllToAllKernelImpl", "ncclDevKernel", "step − total kernel", "main-stream-busy", "profile drill-down", or suspicion that TraceLens numbers are off by ~1.5–2×.

$ git log --oneline --stat
stars:٢٨
forks:٣
updated:٩ مايو ٢٠٢٦ في ٢١:١٠
SKILL.md
readonly