en un clic
add-eval
// Add or update an LLM eval test for a specific tool or collection. Creates a new eval scenario or updates an existing one to cover new tools. Use when adding eval coverage for new or modified tools.
// Add or update an LLM eval test for a specific tool or collection. Creates a new eval scenario or updates an existing one to cover new tools. Use when adding eval coverage for new or modified tools.
Load MCP development patterns and best practices for building tools with the @umbraco-cms/mcp-server-sdk. Use when starting tool development or needing pattern reference.
Guide for configuring and running Umbraco MCP servers via the CLI. Use when the user wants to set up an MCP server for Claude Code, configure auth, filtering, dry-run, readonly mode, or use introspection commands to understand available tools.
Add an integration test for a specific tool in an existing collection. Creates a test file following the collection's established patterns. Use when adding tests for new or modified tools, or re-creating deleted tests.
Add a new tool to an existing MCP collection. Creates the tool file, updates the collection index, optionally adds integration tests and eval tests. Use when adding new API endpoints to collections already created by '/build-tools'.
Build LLM eval tests for MCP tool collections. Reads .discover.json and creates eval setup and scenario test files per collection. Use after running '/build-tools'.
Build MCP tool collections from discovered API groups. Reads .discover.json and generates tools and collection registrations. Use after running 'npx @umbraco-cms/create-umbraco-mcp-server discover'.
| name | add-eval |
| description | Add or update an LLM eval test for a specific tool or collection. Creates a new eval scenario or updates an existing one to cover new tools. Use when adding eval coverage for new or modified tools. |
| user_invocable | true |
Add or update an LLM eval test for a specific tool or collection. This skill creates a new eval scenario or updates an existing one to cover new tools.
Use this skill when:
/add-tool and want eval coverage/build-evals skipped the collection because eval files already existBefore running, ensure:
src/umbraco-api/tools/{collection}/index.ts)npm run buildANTHROPIC_API_KEY)/add-eval form)/add-eval form "copy form workflow")If no hint is provided, compare the tools in the collection against existing eval test coverage and suggest what's missing.
| Agent | When to use |
|---|---|
eval-test-creator | Creating or updating eval test files (Step 3 or 4) |
BUILD BEFORE RUNNING. Eval tests run against dist/index.js. Always npm run build first.
RUN COMMANDS SEPARATELY. Always run build and test as separate Bash calls. Never chain with &&.
ITERATE ON PROMPTS. Eval tests are probabilistic. If a test fails, the fix is usually in the prompt.
VERBOSE DURING DEVELOPMENT. Always set verbose: true when creating or debugging.
Check if the eval setup exists:
tests/evals/helpers/e2e-setup.tstests/evals/jest.config.tsIf the setup doesn't exist, tell the user to run /build-evals {collection} first to create the infrastructure. This skill does not create eval setup files.
Read the collection's tools and existing eval tests:
src/umbraco-api/tools/{collection}/index.ts — all tools in the collectiontests/evals/{collection}-*.test.ts — existing eval scenariosBuild an inventory:
tools and requiredTools arrays)Based on the inventory and the user's request:
Update an existing scenario when:
copy-form to a CRUD scenario)Create a new scenario when:
If updating an existing eval test:
COLLECTION_TOOLS arrayrequiredTools if it must be calledBuild and run:
npm run build
npm run test:evals -- --testPathPattern="{collection}"
If the test fails, iterate on the prompt. Common fixes:
Use the eval-test-creator agent to create tests/evals/{collection}-{workflow}.test.ts. The agent owns the test template and patterns.
Build and run:
npm run build
npm run test:evals -- --testPathPattern="{collection}-{workflow}"
Iterate on the prompt until the test passes reliably.
Report what was done: