Skip to main content
在 Manus 中运行任何 Skill
一键导入
$pwd:

hyperpod-cluster-debugger

// Diagnose and remediate cluster-wide HyperPod (EKS or Slurm) problems — creation / deployment failures (CloudFormation, EFA health check, lifecycle scripts, capacity), EKS access, node replacement, CloudFormation nested-stack errors, post-maintenance rollback state, dangling nodes, autoscaler conflicts. Includes `--validate` pre-flight. Read-only.

$ git log --oneline --stat
stars:765
forks:107
updated:2026年5月16日 23:28
文件资源管理器
8 个文件
SKILL.md
readonly