nat-evaluation

Name: Nat Evaluation
Author: NVIDIA

// Use when designing, configuring, running, or troubleshooting NeMo Agent Toolkit evaluations, datasets, evaluator selection, ATIF surfaces, quality gates, custom evaluators, and `nat eval`.

Ejecutar en Manus

$ git log --oneline --stat

stars:2319

forks:654

updated:13 de mayo de 2026, 21:15

Explorador de archivos

13 archivos

SKILL.md

readonly

name	nat-evaluation
description	Use when designing, configuring, running, or troubleshooting NeMo Agent Toolkit evaluations, datasets, evaluator selection, ATIF surfaces, quality gates, custom evaluators, and `nat eval`.
author	NVIDIA Corporation and Affiliates
license	Apache-2.0

NeMo Agent Toolkit Evaluation

Use this skill for measuring agent quality and behavior.

Workflow

Decide the evaluation surface and output format.
Decompose quality goals into separate evaluators.
Choose built-in evaluators before writing custom evaluators.
Keep datasets small and explicit for local validation.
Run nat eval and inspect generated artifacts.

References

references/operating-mode.md
references/methodology.md
references/agent-eval-framework.md
references/evaluation-surfaces.md
references/evaluation-contract.md
references/evaluators/
references/code-patterns.md

related-skills.json

mismo repositorio

skill-evolution.md

from "NVIDIA/NeMo-Agent-Toolkit"

Use before creating, editing, or deciding whether to update any AI coding agent skill in this repository, including corrections to existing skill behavior, references, or routing.

2026-05-192.3k

nat-agent-configuration.md

from "NVIDIA/NeMo-Agent-Toolkit"

Use when selecting, configuring, composing, or troubleshooting NeMo Agent Toolkit agents and control-flow components, including ReAct, tool-calling, ReWOO, reasoning, router, sequential, parallel, and sub-agent patterns.

2026-05-132.3k

nat-installation.md

from "NVIDIA/NeMo-Agent-Toolkit"

Use when installing or configuring NVIDIA NeMo Agent Toolkit, verifying the `nat` CLI, setting up optional extras, or creating a first hello-world workflow.

2026-05-132.3k

nat-mcp-and-serving.md

from "NVIDIA/NeMo-Agent-Toolkit"

Use when serving NeMo Agent Toolkit workflows, exposing workflows through FastAPI, configuring MCP clients or servers, or troubleshooting transport and server setup.

2026-05-132.3k

nat-optimization.md

from "NVIDIA/NeMo-Agent-Toolkit"

Use when configuring or running NeMo Agent Toolkit optimization with `nat optimize`, including Optuna parameter tuning, prompt evolution, optimizer sizing, output interpretation, and optimizer datasets.

2026-05-132.3k

nat-path-checks.md

from "NVIDIA/NeMo-Agent-Toolkit"

Use when fixing NeMo Agent Toolkit documentation path-check failures, especially failed `ci/scripts/path_checks.py` output, slash-delimited text mistaken for paths, relative path references, Markdown code escaping, and path-check allowlist decisions.

2026-05-132.3k

package.json