Jeden Skill in Manus ausführen
mit einem Klick

Jeden Skill in Manus mit einem Klick ausführen

$pwd:

k8s-launch-kit-discover

Name: K8s Launch Kit Discover
Author: NVIDIA

// Use this skill when the user wants to discover their Kubernetes cluster's network hardware capabilities using k8s-launch-kit (l8k). Activate for: cluster discovery, hardware detection, NIC detection, finding what GPUs or NICs are in a cluster, creating a cluster config file, or when the user says 'discover' in the context of l8k or NVIDIA networking.

In Manus ausführen

$ git log --oneline --stat

stars:11

forks:3

updated:7. Mai 2026 um 08:33

Datei-Explorer

2 Dateien

SKILL.md

readonly

name	k8s-launch-kit-discover
version	1.1.0
description	Use this skill when the user wants to discover their Kubernetes cluster's network hardware capabilities using k8s-launch-kit (l8k). Activate for: cluster discovery, hardware detection, NIC detection, finding what GPUs or NICs are in a cluster, creating a cluster config file, or when the user says 'discover' in the context of l8k or NVIDIA networking.
metadata	{"requires":{"skills":["k8s-launch-kit-shared"]}}

l8k: Cluster Discovery

PREREQUISITE: Read ../k8s-launch-kit-shared/SKILL.md for install paths, global flags, and output modes.

Discover cluster hardware and produce a cluster-config.yaml describing NICs, GPUs, rails, and node groups.

Usage (from AI agent)

l8k discover \
  --kubeconfig ~/.kube/config \
  --save-cluster-config ./cluster-config.yaml \
  --output json 2>/dev/null | jq .

Usage (human-interactive)

l8k discover --save-cluster-config <OUTPUT> [--kubeconfig <PATH>]

Flags

Flag	Required	Default	Description
`--kubeconfig`	—	`$KUBECONFIG` env var	Path to kubeconfig (optional — falls back to env var)
`--save-cluster-config`	Yes	—	Output path for cluster-config.yaml
`--network-operator-namespace`	—	`nvidia-network-operator`	Override operator namespace
`--user-config`	—	—	Base config to merge with discovered hardware
`--node-selector`	—	—	Restrict to matching nodes
`--image-pull-secrets`	—	—	Image pull secret names for NicClusterPolicy (comma-separated)

Examples

# Basic discovery
l8k discover \
  --kubeconfig ~/.kube/config \
  --save-cluster-config ./cluster-config.yaml

# Using $KUBECONFIG env var (no --kubeconfig needed)
l8k discover --save-cluster-config ./cluster-config.yaml

# Non-default operator namespace
l8k discover \
  --kubeconfig ~/.kube/config \
  --network-operator-namespace network-operator \
  --save-cluster-config ./cluster-config.yaml

# Merge with existing config
l8k discover --user-config my-config.yaml \
  --kubeconfig ~/.kube/config \
  --save-cluster-config ./cluster-config.yaml

# Agent mode (JSON output)
l8k discover \
  --kubeconfig ~/.kube/config \
  --save-cluster-config ./cluster-config.yaml \
  --output json 2>/dev/null

Output Format

The generated cluster-config.yaml contains a clusterConfig[] array. Each element is a hardware group:

clusterConfig:
  - identifier: "dgx-b200-nvidia-h100-nvl"
    machineType: DGX-B200
    gpuType: NVIDIA-H100-NVL
    capabilities:
      nodes:
        sriov: true
        rdma: true
    pfs:
      - deviceID: "101e"
        networkInterface: "eth0"
        rail: 0
    workerNodes: [node-01, node-02]
    nodeSelector:
      nvidia.kubernetes-launch-kit.machine: "DGX-B200-NVIDIA-H100-NVL"
    thirdPartyRDMAModules: [nv_peer_mem]

Discovery patches every node in the group with two labels:

nvidia.kubernetes-launch-kit.machine: <machineType>-<gpuType> — per-source-group identity, used as the source group's nodeSelector.
nvidia.kubernetes-launch-kit.gpu: <gpuType> — used as the merged-group nodeSelector when l8k generate auto-merges source groups sharing a GPU type.

Label values keep their original case (matching nvidia.com/gpu.product style) since upstream discovery already trims whitespace and replaces spaces with hyphens. Values that would exceed the Kubernetes 63-char label-value limit are skipped (logged at debug). The group's identifier is the lowercase resource-name form of the machine label (RFC 1123 — required for downstream NicNodePolicy / SriovNetworkNodePolicy naming). When machineType or gpuType couldn't be resolved (GPU operator labels absent and hardware probe failed), a fallback group-N identifier is used and the machine label is not written; the GPU label is still written when gpuType alone is resolved.

Prerequisites

NVIDIA Network Operator Helm chart installed in the cluster
Node Feature Discovery (NFD) active with NodeFeature CRDs populated
Worker nodes with label feature.node.kubernetes.io/pci-15b3.present=true

Tips

If discovery fails with "no pods found for DaemonSet", the error will suggest using --network-operator-namespace. Common namespaces are nvidia-network-operator and network-operator.
Discovery uses server-side apply (field owner l8k-discovery) — it won't conflict with an existing NicClusterPolicy.
After determining each group's (machineType, gpuType), discovery looks up a topology preset under presets/ using exact-match lookup on that pair. A matching preset overrides heuristic-derived topology fields (traffic class, rail, NUMA, GPU affinity). There is no any-GPU fallback — a preset with empty gpuType: is rejected at load time. If no preset matches, discovery proceeds with heuristic classification.
If you already know the SKU and want to skip cluster discovery entirely, use l8k generate --for <preset> (see k8s-launch-kit-generate).

Use this skill when the user wants to generate Kubernetes YAML manifests for NVIDIA networking deployment using k8s-launch-kit (l8k). Activate for: manifest generation, profile selection, choosing between SR-IOV/host-device/RDMA-shared/IPoIB/MacVLAN/Spectrum-X, creating deployment files, or when the user asks 'which profile should I use' or needs help choosing a network configuration.

2026-05-0711

k8s-launch-kit-config.md

from "NVIDIA/k8s-launch-kit"

Use this skill when the user needs help understanding, creating, or editing a k8s-launch-kit (l8k) configuration file (l8k-config.yaml or cluster-config.yaml). Activate for: config file questions, parameter tuning, subnet configuration, NV-IPAM setup, DOCA driver settings, NIC configuration operator settings, changing MTU, VFs, resource names, or understanding what any config field does.

2026-05-0711

k8s-launch-kit-validate.md

from "NVIDIA/k8s-launch-kit"

Use this skill when the user wants to verify that an NVIDIA networking deployment matches the configuration that produced it. Activate for: 'is my deployment correct', 'are all the manifests applied', 'does the network operator version match', 'verify deployment', 'check cluster state against config', or any question about whether the cluster reflects what l8k generated. Wraps the `l8k validate` subcommand.

2026-05-0611

k8s-network-engineer.md

from "NVIDIA/k8s-launch-kit"

Embody a senior NVIDIA Networking Engineer who is an expert on deploying cloud-native networking on Kubernetes with k8s-launch-kit (l8k). Activate whenever the user mentions NVIDIA network profiles, SR-IOV, RDMA, Spectrum-X, BlueField, ConnectX, NIC configuration, Network Operator, DOCA drivers, multirail networking, l8k, k8s-launch-kit, or any Kubernetes networking topic involving NVIDIA hardware. Also activate when the user asks general questions about high-performance networking, GPU interconnect, or RDMA configuration.

2026-05-0611

k8s-launch-kit-deploy.md

from "NVIDIA/k8s-launch-kit"

Use this skill when the user wants to deploy generated NVIDIA networking manifests to a Kubernetes cluster using k8s-launch-kit (l8k). Activate for: applying manifests, deploying to cluster, the `l8k deploy` subcommand or the legacy --deploy flag on `l8k generate`, applying generated files, or any mention of pushing l8k output to a live cluster. Even if the user just says 'apply these' or 'push to cluster' after generating manifests, use this skill.

2026-05-0611

k8s-launch-kit-pipeline.md

from "NVIDIA/k8s-launch-kit"

Use this skill when the user wants to run the full k8s-launch-kit (l8k) pipeline end-to-end: discover cluster hardware, select a profile, generate manifests, and deploy them all in one command. Also activate for CI/CD integration, automation pipelines, 'one-liner', 'complete workflow', or end-to-end NVIDIA networking deployment.

2026-05-0411

package.json

"author": "NVIDIA"

"repository": "NVIDIA/k8s-launch-kit"

GitHub-Repository öffnen Creator-Repositorys ansehen

$ install --global

$ download --local

In Manus ausführen

$ useful --forSOC

Netzwerk- und ComputersystemadministratorenInformatik- und Mathematikberufe15-1244L4

name	k8s-launch-kit-discover
version	1.1.0
description	Use this skill when the user wants to discover their Kubernetes cluster's network hardware capabilities using k8s-launch-kit (l8k). Activate for: cluster discovery, hardware detection, NIC detection, finding what GPUs or NICs are in a cluster, creating a cluster config file, or when the user says 'discover' in the context of l8k or NVIDIA networking.
metadata	{"requires":{"skills":["k8s-launch-kit-shared"]}}

l8k: Cluster Discovery

PREREQUISITE: Read ../k8s-launch-kit-shared/SKILL.md for install paths, global flags, and output modes.

Discover cluster hardware and produce a cluster-config.yaml describing NICs, GPUs, rails, and node groups.

Usage (from AI agent)

l8k discover \
  --kubeconfig ~/.kube/config \
  --save-cluster-config ./cluster-config.yaml \
  --output json 2>/dev/null | jq .

Usage (human-interactive)

l8k discover --save-cluster-config <OUTPUT> [--kubeconfig <PATH>]

Flags

Flag	Required	Default	Description
`--kubeconfig`	—	`$KUBECONFIG` env var	Path to kubeconfig (optional — falls back to env var)
`--save-cluster-config`	Yes	—	Output path for cluster-config.yaml
`--network-operator-namespace`	—	`nvidia-network-operator`	Override operator namespace
`--user-config`	—	—	Base config to merge with discovered hardware
`--node-selector`	—	—	Restrict to matching nodes
`--image-pull-secrets`	—	—	Image pull secret names for NicClusterPolicy (comma-separated)

Examples

# Basic discovery
l8k discover \
  --kubeconfig ~/.kube/config \
  --save-cluster-config ./cluster-config.yaml

# Using $KUBECONFIG env var (no --kubeconfig needed)
l8k discover --save-cluster-config ./cluster-config.yaml

# Non-default operator namespace
l8k discover \
  --kubeconfig ~/.kube/config \
  --network-operator-namespace network-operator \
  --save-cluster-config ./cluster-config.yaml

# Merge with existing config
l8k discover --user-config my-config.yaml \
  --kubeconfig ~/.kube/config \
  --save-cluster-config ./cluster-config.yaml

# Agent mode (JSON output)
l8k discover \
  --kubeconfig ~/.kube/config \
  --save-cluster-config ./cluster-config.yaml \
  --output json 2>/dev/null

Output Format

The generated cluster-config.yaml contains a clusterConfig[] array. Each element is a hardware group:

clusterConfig:
  - identifier: "dgx-b200-nvidia-h100-nvl"
    machineType: DGX-B200
    gpuType: NVIDIA-H100-NVL
    capabilities:
      nodes:
        sriov: true
        rdma: true
    pfs:
      - deviceID: "101e"
        networkInterface: "eth0"
        rail: 0
    workerNodes: [node-01, node-02]
    nodeSelector:
      nvidia.kubernetes-launch-kit.machine: "DGX-B200-NVIDIA-H100-NVL"
    thirdPartyRDMAModules: [nv_peer_mem]

Discovery patches every node in the group with two labels:

nvidia.kubernetes-launch-kit.machine: <machineType>-<gpuType> — per-source-group identity, used as the source group's nodeSelector.
nvidia.kubernetes-launch-kit.gpu: <gpuType> — used as the merged-group nodeSelector when l8k generate auto-merges source groups sharing a GPU type.

Prerequisites

NVIDIA Network Operator Helm chart installed in the cluster
Node Feature Discovery (NFD) active with NodeFeature CRDs populated
Worker nodes with label feature.node.kubernetes.io/pci-15b3.present=true

Tips

If discovery fails with "no pods found for DaemonSet", the error will suggest using --network-operator-namespace. Common namespaces are nvidia-network-operator and network-operator.
Discovery uses server-side apply (field owner l8k-discovery) — it won't conflict with an existing NicClusterPolicy.
After determining each group's (machineType, gpuType), discovery looks up a topology preset under presets/ using exact-match lookup on that pair. A matching preset overrides heuristic-derived topology fields (traffic class, rail, NUMA, GPU affinity). There is no any-GPU fallback — a preset with empty gpuType: is rejected at load time. If no preset matches, discovery proceeds with heuristic classification.
If you already know the SKU and want to skip cluster discovery entirely, use l8k generate --for <preset> (see k8s-launch-kit-generate).

k8s-launch-kit-discover

l8k: Cluster Discovery

Usage (from AI agent)

Usage (human-interactive)

Flags

Examples

Output Format

Prerequisites

Tips

See Also

l8k: Cluster Discovery

Usage (from AI agent)

Usage (human-interactive)

Flags

Examples

Output Format

Prerequisites

Tips

See Also

k8s-launch-kit-discover

l8k: Cluster Discovery

Usage (from AI agent)

Usage (human-interactive)

Flags

Examples

Output Format

Prerequisites

Tips

See Also

Mehr aus diesem Repository

Mehr aus diesem Repository

l8k: Cluster Discovery

Usage (from AI agent)

Usage (human-interactive)

Flags

Examples

Output Format

Prerequisites

Tips

See Also