Create flexible annotation workflows for AI applications. Contains common tools to explore raw ai agent logs/transcripts, extract out relevant evaluation data, and llm-as-a-judge creation.

2025-10-28

#002

skills

1 个 skills00更新于 2026-04-10

占该创作者 50%

skill

职业分类

描述

更新

agent-trace-investigator

软件质量保证分析师与测试员

Investigates agent behaviors and performance from any observability store — local logs, agent trajectory stores, and MCP servers from observability products. Covers how models reason, interact, handle pressure, mirror values, and coordinate. Guides tracing setup. Use when: (1) analyzing agent failures or slowness, (2) auditing reasoning quality, (3) comparing behavioral patterns across models, (4) understanding multi-agent dynamics, (5) setting up tracing, (6) debugging reports of model performance degradations. Triggers: "investigate traces", "analyze agent behavior", "debug the agent", "model behavior", "set up tracing".

2026-04-10

已展示 2 / 2 个仓库

已展示全部仓库