Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic

tsdb-diagnosis

// Diagnose training job incidents and check cluster health using the per-job Prometheus TSDB. Use when the user asks to diagnose a failure root cause, check GPU/network health, query Prometheus metrics, investigate a hang, or when the triage skill recommends deeper TSDB analysis.

$ git log --oneline --stat
stars:28
forks:3
updated:3 mai 2026 à 19:33
SKILL.md
readonly