一键在 Manus 中运行任何 Skill

$pwd:

investigate-ai-session

Name: Investigate Ai Session
Author: amplitude

// Deep-dives into specific AI agent sessions or failure patterns to explain why something went wrong. Only use when the user has Amplitude Agent Analytics instrumented in their project. Use when investigating a specific session ID, debugging agent failures, understanding why quality is low, tracing tool errors, or when monitor-ai-quality surfaces an issue that needs root cause analysis.

在 Manus 中运行

$ git log --oneline --stat

stars:26

forks:6

updated:2026年4月1日 19:27

SKILL.md

readonly

related-skills.json

同仓库

taxonomy.md

from "amplitude/mcp-marketplace"

Source of truth for event taxonomy generation, data auditing, and governance best practices in Amplitude. Use when an agent needs to create, validate, audit, score, or recommend improvements to event tracking plans, naming conventions, property standards, data quality, or deprecation workflows. Covers naming rules, property standards, scoring frameworks, safe metadata operations, deprecation procedures, and AI readiness guidance.

2026-05-0126

analyze-ai-topics.md

from "amplitude/mcp-marketplace"

Analyzes what users ask AI agents about and how well each topic is served. Only use when the user has Amplitude Agent Analytics instrumented in their project. Use when the user asks "what are people asking the AI", "top AI topics", "where is the AI struggling", "AI coverage gaps", "what should we improve in our AI", or wants product insights from AI conversation patterns.

2026-04-0126

monitor-ai-quality.md

from "amplitude/mcp-marketplace"

Monitors AI agent health across quality, cost, performance, and errors. Only use when the user has Amplitude Agent Analytics instrumented in their project. Use when the user asks "how are our AI agents doing", "AI quality check", "agent health", "AI errors", "agent performance", "LLM cost", or wants a proactive health report on their AI/LLM features.

2026-04-0126

review-agent-insights.md

from "amplitude/mcp-marketplace"

Retrieves, synthesizes, and prioritizes all recent AI agent results from Amplitude. Queries every agent type available in get_agent_results, validates freshness, and produces a unified narrative ranked by impact. Use when the user asks "what has the AI found", "show me agent insights", "any AI findings", "what did Amplitude discover", "review AI insights", or wants a digest of everything Amplitude's AI agents have surfaced recently.

2026-04-0126

add-analytics-instrumentation.md

from "amplitude/mcp-marketplace"

End-to-end analytics instrumentation workflow for a PR, branch, file, directory, or feature. Reads the code, discovers what events should be tracked, and produces a concrete instrumentation plan — all in one shot. Use this skill whenever a user wants to add analytics to a PR, asks "instrument this PR", "add tracking to this branch", "what analytics does this file need", "instrument the checkout flow", "run the full instrumentation workflow", or any request that implies going from code changes to a tracking plan. Also trigger when the user gives you a PR link, branch name, file path, or feature description and mentions analytics, events, or instrumentation. This is the main entry point for the analytics workflow — prefer it over calling the individual steps (diff-intake, discover-event-surfaces, instrument-events) separately.

2026-03-2626

diff-intake.md

from "amplitude/mcp-marketplace"

Reads a PR or branch diff and produces a structured YAML change brief for downstream analytics instrumentation skills. Use this as the first step whenever a user shares a PR link, branch comparison, or raw diff and wants to understand what changed, what needs tracking, or how to instrument a feature. Trigger on phrases like "review this PR", "what changed in this branch", "help me instrument this diff", "check analytics coverage for this change", or any request to start the analytics review workflow.

2026-03-2626

package.json

"author": "amplitude"

"repository": "amplitude/mcp-marketplace"

打开 GitHub 仓库查看创作者相关仓库

$ install --global

$ download --local

在 Manus 中运行

$ useful --forSOC

计算机系统分析师计算机与数学类职业15-1211L4

name	investigate-ai-session
description	Deep-dives into specific AI agent sessions or failure patterns to explain why something went wrong. Only use when the user has Amplitude Agent Analytics instrumented in their project. Use when investigating a specific session ID, debugging agent failures, understanding why quality is low, tracing tool errors, or when monitor-ai-quality surfaces an issue that needs root cause analysis.

AI Session Investigator

You investigate specific AI agent sessions or failure patterns to determine root causes. You operate at the session and span level — reading conversations, tracing execution, and connecting failures to their origins. This is the "why" skill that follows the "what" from /monitor-ai-quality.

Instructions

Step 1: Determine Investigation Scope

The user will provide one of:

A specific session ID → go directly to Step 2
A failure pattern (e.g., "Chart Agent timeouts", "tool errors in the last day") → go to Step 1b
A user complaint (e.g., "user X said the agent didn't work") → go to Step 1c
A vague signal (e.g., "something's off with the agents") → redirect to /monitor-ai-quality first, then come back with specific findings

Step 1b: Find Sessions Matching a Pattern

Call Amplitude:get_agent_analytics_schema with include: ["filter_options"] to discover valid agent names, tool names, and topic values. Then call Amplitude:query_agent_analytics_sessions with appropriate filters:

Agent failures: agentNames: ["<agent>"], hasTaskFailure: true
Tool errors: toolNames: ["<tool>"], hasTaskFailure: true
Technical failures: hasTechnicalFailure: true
Low quality: maxQualityScore: 0.4
Frustrated users: maxSentimentScore: 0.4 or hasNegativeFeedback: true
Expensive sessions: minCostUsd: <threshold>
Slow sessions: minDurationMs: <threshold>
Specific topic: primaryTopics: ["<topic>"] or use topicClassifications for model-specific filtering

Use responseFormat: "concise", limit: 20, and sort by "-session_start" to get recent examples. Select the 3-5 most representative sessions for deep investigation.

Step 1c: Find a Specific User's Sessions

Call Amplitude:query_agent_analytics_sessions with searchQuery: "<email or user ID>" to find their sessions. If they reported a specific timeframe, add startDate/endDate. Pick the session(s) that match the complaint.

Step 2: Deep-Dive into Sessions (Budget: 3-6 calls)

For each session being investigated (max 3-5 sessions), run these in parallel per session:

Full session detail. Call Amplitude:query_agent_analytics_sessions with sessionIds: ["<id>"], responseFormat: "detailed". This returns enrichment data: rubric scores, failure reasons, topic classifications, overall outcome, and quality flags.
Conversation transcript. Call Amplitude:get_agent_analytics_conversation with sessionId: "<id>", includeCategories: true. Read the full user-agent exchange to understand what was asked, how the agent responded, and where things broke down.
Execution trace. Call Amplitude:query_agent_analytics_spans with sessionId: "<id>". This shows every LLM call, tool call, and embedding operation — their latency, status, cost, and ordering. Look for:
- Spans with status: "ERROR" — direct failures
- Tool calls with high latency (>10s) — timeouts or slow dependencies
- Multiple retries of the same tool — agent struggling
- LLM calls with unusually high token counts — potential prompt bloat
- The sequence of operations — did the agent take a reasonable path?

Step 3: Root Cause Analysis

With conversation + trace + enrichment data, build the diagnosis:

Classify the failure type:
- Tool failure: A tool call returned an error or timed out. Check the span's status and error details. Was it the right tool? Did the agent pass valid inputs?
- LLM failure: The model produced a bad response — hallucination, refusal, wrong format, or infinite loop. Check the conversation for where the response diverged.
- Orchestration failure: The agent chose the wrong tools, called them in the wrong order, or gave up too early. Trace the span sequence.
- User confusion: The user's request was ambiguous or impossible. The agent failed to clarify. Check the first 1-2 turns.
- Data/context issue: The agent had insufficient context — missing schema, wrong project, stale data. Check what context was available.
Determine scope: Is this a one-off or systemic?
- If investigating a pattern (Step 1b), check: Do all failing sessions share the same failure type, tool, or agent? Use Amplitude:query_agent_analytics_sessions with groupBy: ["agent_name"] or groupBy: ["primary_topic"] to see if failures cluster.
- If a single session, call Amplitude:query_agent_analytics_sessions with the same agent and time window to check if similar failures exist.
Find the trigger: What changed?
- Check if failures started on a specific date (new deployment, model change, config update)
- Check if failures correlate with specific topics or user segments
- Check if a tool's error rate changed using Amplitude:query_agent_analytics_spans with groupBy: ["tool_name"]

Step 4: Search for Related Patterns (Budget: 1-2 calls)

If the root cause isn't clear from the session data alone:

Search conversations. Call Amplitude:search_agent_analytics_conversations with keywords from the error or topic to find other sessions with the same issue. This surfaces patterns the session-level queries might miss.
Check tool/model health. Call Amplitude:query_agent_analytics_spans with groupBy: ["tool_name"] or groupBy: ["model_name"] over the relevant time window. Look for tools with elevated error rates or latency that correlate with the failing sessions.

Step 5: Present the Investigation

Structure the output as a root cause analysis.

Required sections:

Investigation summary (2-3 sentences): What was investigated, what was found, and the severity. Written as a headline for the team.
Sessions examined: A compact table of the sessions investigated:

| Session ID | Agent | Outcome | Quality | Sentiment | Failure Type |
|------------|-------|---------|---------|-----------|--------------|
| [id] | [name] | [outcome] | [score] | [score] | [type or —] |

Root cause (1 paragraph): The primary explanation for what went wrong. Be specific — name the tool, the error, the model behavior, or the orchestration issue. Include evidence from the conversation and trace.
Execution trace highlights (for the most illustrative session): Walk through the key spans showing the failure path:
- "Turn 1: User asked X → Agent called tool Y (OK, 2.1s) → Agent called tool Z (ERROR, timeout after 30s) → Agent responded with fallback that didn't address the question"
- Focus on the failure point and what led to it
Conversation excerpt (if revealing): Quote the 2-3 most relevant turns showing where the agent failed the user. Keep it brief.
Scope assessment: One-off vs. systemic. How many sessions are affected? Is it getting worse?
Recommended fixes (2-4 numbered items): Concrete actions. Examples:
- "Add a retry with exponential backoff for the query_dataset tool — 8 of 15 failures are transient timeouts"
- "The agent is calling get_events before get_context, causing a missing project ID error — fix the tool ordering in the agent prompt"
- "Users asking about retention are getting routed to the Chart Agent instead of the Funnel Agent — update the routing logic"
Follow-on prompt: Offer next steps — "Want me to check if this tool timeout affects other agents, search for similar user complaints, or monitor this pattern over the next few days?"

Examples

Example 1: Specific Session Investigation

User says: "What happened in session abc-123?"

Actions:

Get detailed session data, conversation, and spans for abc-123 (3 parallel calls)
Read the conversation to understand what the user wanted
Trace the spans to find where the execution failed
Classify the failure and check if it's systemic
Present root cause with trace highlights and conversation excerpt

Example 2: Pattern Investigation

User says: "Why are Chart Agent sessions failing?"

Actions:

Get AI schema to confirm "Chart Agent" is a valid agent name
Query recent Chart Agent failures (hasTaskFailure: true, agentNames: ["Chart Agent"])
Pick the 3 most recent failures and deep-dive into each
Compare the failures — same tool? Same error? Same topic?
Check tool health with span aggregations
Present the pattern with root cause and scope assessment

Example 3: User Complaint

User says: "A customer said our AI gave them wrong data yesterday"

Actions:

Ask for the customer's email or user ID
Search for their sessions from yesterday
Deep-dive into the relevant session(s)
Read the conversation to find what data was wrong
Trace the spans to see what tools provided the data
Present findings with the specific conversation excerpt showing the error

Troubleshooting

Session ID not found

The session may be from a different project, or outside the data retention window. Ask the user to confirm the project and check if the session ID is correct.

Spans not available for a session

Span-level data requires OpenTelemetry-compatible tracing in the AI agent. Report what's available from the session and conversation level and note that span data would help narrow the root cause.

Too many failing sessions to investigate

Don't try to investigate more than 5 sessions in detail. Instead, use groupBy on query_agent_analytics_sessions to find the common pattern, then deep-dive into 2-3 representative examples.

investigate-ai-session

同仓库更多 Skills

同仓库更多 Skills

AI Session Investigator

Instructions

Step 1: Determine Investigation Scope

Step 1b: Find Sessions Matching a Pattern

Step 1c: Find a Specific User's Sessions

Step 2: Deep-Dive into Sessions (Budget: 3-6 calls)

Step 3: Root Cause Analysis

Step 4: Search for Related Patterns (Budget: 1-2 calls)

Step 5: Present the Investigation

Examples

Example 1: Specific Session Investigation

Example 2: Pattern Investigation

Example 3: User Complaint

Troubleshooting

Session ID not found

Spans not available for a session

Too many failing sessions to investigate

AI Session Investigator

Instructions

Step 1: Determine Investigation Scope

Step 1b: Find Sessions Matching a Pattern

Step 1c: Find a Specific User's Sessions

Step 2: Deep-Dive into Sessions (Budget: 3-6 calls)

Step 3: Root Cause Analysis

Step 4: Search for Related Patterns (Budget: 1-2 calls)

Step 5: Present the Investigation

Examples

Example 1: Specific Session Investigation

Example 2: Pattern Investigation

Example 3: User Complaint

Troubleshooting

Session ID not found

Spans not available for a session

Too many failing sessions to investigate