Skip to main content
Jeden Skill in Manus ausführen
mit einem Klick
$pwd:

hyperpod-cluster-debugger

// Diagnose and remediate cluster-wide HyperPod (EKS or Slurm) problems — creation / deployment failures (CloudFormation, EFA health check, lifecycle scripts, capacity), EKS access, node replacement, CloudFormation nested-stack errors, post-maintenance rollback state, dangling nodes, autoscaler conflicts. Includes `--validate` pre-flight. Read-only.

$ git log --oneline --stat
stars:765
forks:107
updated:16. Mai 2026 um 23:28
Datei-Explorer
8 Dateien
SKILL.md
readonly