Skip to main content
Ejecuta cualquier Skill en Manus
con un clic

hyperpod-cluster-debugger

// Diagnose and remediate cluster-wide HyperPod (EKS or Slurm) problems — creation / deployment failures (CloudFormation, EFA health check, lifecycle scripts, capacity), EKS access, node replacement, CloudFormation nested-stack errors, post-maintenance rollback state, dangling nodes, autoscaler conflicts. Includes `--validate` pre-flight. Read-only.

$ git log --oneline --stat
stars:765
forks:107
updated:16 de mayo de 2026, 23:28
Explorador de archivos
8 archivos
SKILL.md
readonly