A comprehensive analysis skill for summarizing HAMi vGPU metrics from Prometheus-style `/metrics` output. It organizes GPU allocation by node, device, pod, and namespace, and produces clear reports covering vGPU core allocation, memory allocation, allocation-based utilization, sharing density, and namespace-level usage patterns.

2026-05-08

k8s-debug-pending-pod

Netzwerk- und Computersystemadministratoren

Use when pods are stuck in Pending, CrashLoopBackOff, or ImagePullBackOff state. Performs event-driven triage to quickly identify root cause, then deep-dives into scheduling failures, resource exhaustion, image pull errors, and crash loops.

2026-05-08

#002

k8s-dra-driver

1 skills2110updated 2026-05-09

33% of creator

skill

Beruf

description

updated

hami-dra-kind-testing

Softwarequalitätssicherungsanalysten und -tester

Use when testing the HAMi-Core DRA Driver on a kind cluster — covers cluster setup, Helm-based driver install, ResourceClaim configuration, pod scheduling, HAMi-Core memory limit verification via nvidia-smi, and teardown.

2026-05-09

2 von 2 Repositories angezeigt

Alle Repositories angezeigt