一键在 Manus 中运行任何 Skill

$pwd:

test-tracecat-prompts

Name: Test Tracecat Prompts
Author: TracecatHQ

// Use when evaluating Tracecat MCP prompts, automation-authoring instructions, or agent behavior with the local Tracecat prompt eval harness, including static MCP prompt checks, live local workflow-authoring evals, and Codex vs Claude Code performance comparisons.

在 Manus 中运行

$ git log --oneline --stat

stars:3,624

forks:359

updated:2026年5月30日 02:34

文件资源管理器

2 个文件

SKILL.md

readonly

name	test-tracecat-prompts
description	Use when evaluating Tracecat MCP prompts, automation-authoring instructions, or agent behavior with the local Tracecat prompt eval harness, including static MCP prompt checks, live local workflow-authoring evals, and Codex vs Claude Code performance comparisons.

Test Tracecat Prompts

Workflow

Use the local harness at scripts/evals/tracecat_authoring/run_local.py. Keep these evals local-only; do not wire them into CI unless the user explicitly asks.

For prompt/schema regressions that do not need LLM calls, run:

uv run python scripts/evals/tracecat_authoring/run_local.py --static-only

For live generation evals, use the active local cluster MCP URL. Check it with ./scripts/cluster ports and use the MCP: URL. If no local cluster is running, start one with just cluster up -d. Use http://127.0.0.1:8099/mcp only when you are intentionally running the MCP server directly outside the cluster helper.

MCP_URL="$(./scripts/cluster ports | awk '/MCP:/ {print $2}')"
uv run python scripts/evals/tracecat_authoring/run_local.py \
  --mcp-url "$MCP_URL" \
  --cases smoke \
  --agent codex

To compare Claude Code against Codex, run both agents against the same cases:

MCP_URL="$(./scripts/cluster ports | awk '/MCP:/ {print $2}')"
TRACECAT_EVAL_CLAUDE_MODEL=claude-opus-4-7 \
uv run python scripts/evals/tracecat_authoring/run_local.py \
  --mcp-url "$MCP_URL" \
  --cases smoke \
  --agents codex,claude-code

Use TRACECAT_EVAL_WORKSPACE_ID to pin the workspace, TRACECAT_MCP_BEARER_TOKEN when the local MCP server requires auth, TRACECAT_EVAL_MODEL for Codex, and TRACECAT_EVAL_CLAUDE_MODEL for Claude Code.

Interpretation

Static and unit tests are fast because they parse prompts and local schemas only. They do not call live LLMs. Live agent evals are slower, create or edit workflows in the selected local workspace, and spend model credits.

Read .tracecat/evals/tracecat_authoring/<timestamp>/report.md first. The report starts with a performance matrix:

Speed is wall-clock duration per agent run.
Accuracy is the fraction of structural/rubric checks passed.
MCP efficiency and reliability come from on-disk transcripts: total MCP tool calls, successful calls, failed calls, and schema/input failures.
Improvements are derived from failed rubric checks, invocation errors, and failed transcript-derived MCP checks.

Claude Code may return prose instead of strict structured JSON. Inspect transcript.jsonl and final.json under the case artifact directory; the transcript is the source of truth for MCP tool-call behavior.

Guardrails

Keep fixtures synthetic: example.com, placeholder ids, no real people, no real tokens.
Do not treat existing unit tests as live generation coverage.
Do not inspect or score unrelated dirty repo changes while running the evals.
For existing workflow edit cases, expect the agent to use get_workflow, validate-only edit_workflow, then apply the edit using the original draft_revision returned by get_workflow.

related-skills.json

同仓库

tracecat-automation-best-practices.md

from "TracecatHQ/tracecat"

Use when building, editing, validating, or debugging generic Tracecat automations through Tracecat MCP, including workflow DSL/YAML authoring, table design, unique indexes, run-python Tracecat imports, agent presets, ai.agent or ai.preset_agent workflows, executions, validation, and workflow best practices.

2026-05-303.6k

tracecat-slackbot-best-practices.md

from "TracecatHQ/tracecat"

Use when building, editing, validating, or debugging Tracecat Slack bots and Slack-facing automations through Tracecat MCP, including Slack app mentions, interactive messages, event subscriptions, webhooks, thread replies, Slack tools, ai.agent or ai.preset_agent bots, Slack tone, and Slack smoke tests.

2026-05-303.6k

gh-prerelease.md

from "TracecatHQ/tracecat"

Cut a GitHub prerelease off a specific commit — branch, bump version with `just update-version`, tag, push, and publish a prerelease

2026-05-123.6k

docs-authoring.md

from "TracecatHQ/tracecat"

Use when adding or updating documentation pages in an existing docs site. Covers matching nearby docs tone and structure, planning navigation and page content, running product services and docs previews, capturing supporting UI screenshots with Chrome DevTools, suppressing Next.js floating dev indicators before screenshots, taking full-height Mintlify docs screenshots for PR descriptions, keeping PR-only artifacts out of committed files, and creating or updating GitHub PRs with verified documentation labels and screenshot links.

2026-05-063.6k

package.json

"author": "TracecatHQ"

"repository": "TracecatHQ/tracecat"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

软件质量保证分析师与测试员计算机与数学类职业15-1253L4

name	test-tracecat-prompts
description	Use when evaluating Tracecat MCP prompts, automation-authoring instructions, or agent behavior with the local Tracecat prompt eval harness, including static MCP prompt checks, live local workflow-authoring evals, and Codex vs Claude Code performance comparisons.

Test Tracecat Prompts

Workflow

Use the local harness at scripts/evals/tracecat_authoring/run_local.py. Keep these evals local-only; do not wire them into CI unless the user explicitly asks.

For prompt/schema regressions that do not need LLM calls, run:

uv run python scripts/evals/tracecat_authoring/run_local.py --static-only

For live generation evals, use the active local cluster MCP URL. Check it with ./scripts/cluster ports and use the MCP: URL. If no local cluster is running, start one with just cluster up -d. Use http://127.0.0.1:8099/mcp only when you are intentionally running the MCP server directly outside the cluster helper.

MCP_URL="$(./scripts/cluster ports | awk '/MCP:/ {print $2}')"
uv run python scripts/evals/tracecat_authoring/run_local.py \
  --mcp-url "$MCP_URL" \
  --cases smoke \
  --agent codex

To compare Claude Code against Codex, run both agents against the same cases:

MCP_URL="$(./scripts/cluster ports | awk '/MCP:/ {print $2}')"
TRACECAT_EVAL_CLAUDE_MODEL=claude-opus-4-7 \
uv run python scripts/evals/tracecat_authoring/run_local.py \
  --mcp-url "$MCP_URL" \
  --cases smoke \
  --agents codex,claude-code

Interpretation

Read .tracecat/evals/tracecat_authoring/<timestamp>/report.md first. The report starts with a performance matrix:

Speed is wall-clock duration per agent run.
Accuracy is the fraction of structural/rubric checks passed.
MCP efficiency and reliability come from on-disk transcripts: total MCP tool calls, successful calls, failed calls, and schema/input failures.
Improvements are derived from failed rubric checks, invocation errors, and failed transcript-derived MCP checks.

Guardrails

Keep fixtures synthetic: example.com, placeholder ids, no real people, no real tokens.
Do not treat existing unit tests as live generation coverage.
Do not inspect or score unrelated dirty repo changes while running the evals.
For existing workflow edit cases, expect the agent to use get_workflow, validate-only edit_workflow, then apply the edit using the original draft_revision returned by get_workflow.

test-tracecat-prompts

Test Tracecat Prompts

Workflow

Interpretation

Guardrails

同仓库更多 Skills

同仓库更多 Skills

Test Tracecat Prompts

Workflow

Interpretation

Guardrails