with one click
kubernetes
Diagnose and fix Kubernetes pods, services, networking, storage, and rollout failures with kubectl. Safety-first, read-only by default.
Menu
Diagnose and fix Kubernetes pods, services, networking, storage, and rollout failures with kubectl. Safety-first, read-only by default.
| name | kubernetes |
| description | Diagnose and fix Kubernetes pods, services, networking, storage, and rollout failures with kubectl. Safety-first, read-only by default. |
| tools | [{"type":"shell","allowed_commands":["kubectl","helm"],"require_confirmation":false,"timeout_seconds":30,"max_output_bytes":102400},{"type":"think"},{"type":"datetime"}] |
| requires | {"bins":["kubectl"]} |
You have Kubernetes diagnostic capabilities via kubectl and helm.
Use these tools when the user asks about:
Follow this six-step process for every issue:
Confirm cluster access and context before running any commands:
kubectl config current-context
kubectl cluster-info
If the user has not specified a namespace, ask or default to the namespace from context. Always confirm the target cluster before making changes.
Classify the issue into one of these layers:
| Layer | Symptoms |
|---|---|
| Pod | CrashLoopBackOff, ImagePullBackOff, OOMKilled, exit codes |
| Service/Network | Connection refused, DNS failures, endpoint mismatches |
| Node/Scheduling | Pending pods, taints, resource pressure |
| Storage | PVC pending, mount errors, capacity |
| Config | ConfigMap/Secret not found, env var issues |
| Rollout | Stuck deployments, replica mismatches |
Collect evidence using kubectl. Start broad, then narrow:
kubectl get pods -n <ns>, kubectl get events -n <ns> --sort-by=.lastTimestampkubectl describe pod <name> -n <ns>, kubectl logs <name> -n <ns> --tail=100kubectl get pod <name> -n <ns> -o yaml, container-level logs with -cUse the think tool to reason through the evidence before concluding.
Consult reference materials before proposing fixes. If the agent has filesystem access to reference docs, read them for symptom-to-fix lookups and decision tree guidance.
Propose a fix with:
Safety rules:
kubectl delete namespace or kubectl delete --all without
explicit confirmationkubectl get <resource> <name> -n <ns> -o yaml before patchingAfter applying a fix:
kubectl get events -n <ns> --sort-by=.lastTimestamp | head -20kubectl get pods -n <ns> -o wide
kubectl describe pod <name> -n <ns>
kubectl logs <name> -n <ns> --tail=100
kubectl logs <name> -n <ns> --previous
kubectl top pod -n <ns>
kubectl get svc -n <ns>
kubectl get endpoints <svc> -n <ns>
kubectl describe svc <svc> -n <ns>
kubectl get ingress -n <ns>
kubectl get networkpolicies -n <ns>
kubectl top nodes
kubectl describe node <name>
kubectl get resourcequotas -n <ns>
kubectl get limitranges -n <ns>
kubectl rollout undo deployment/<name> -n <ns>
kubectl scale deployment/<name> --replicas=<n> -n <ns>
kubectl cordon <node>
kubectl drain <node> --ignore-daemonsets --delete-emptydir-data
An issue is resolved when:
Translate natural language image descriptions into detailed, structured DALL-E prompts with subject, style, composition, lighting, and mood specifications.
Decompose mathematical problems into sub-expressions, evaluate each one with the calculator tool, and present the full working chain. Handles arithmetic, trigonometry, logarithms, and financial formulas.
Compare two or more PDF documents by extracting targeted sections, building a structured comparison matrix, and highlighting differences with page references.
Extract structured data from web pages using browser snapshot and text tools, then process it into tables, comparisons, or summaries using Python.
Analyze endpoint latency trends using historical check data from memory. Detects slow degradation, spikes vs sustained issues, and calculates baseline deviations.
Validate API response structure and content. Detects schema drift, unexpected null values, and abnormal response sizes.