Run any Skill in Manus with one click

error-handling

Stars8

Forks1

UpdatedMarch 21, 2026 at 20:01

Debug and recover from agent team errors including common errors, hooks for quality gates, known limitations, and recovery strategies. Use when encountering team errors, enforcing quality gates with hooks, understanding limitations, or debugging agent issues.

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

Run Skill in Manus

Source

zircote

zircote/claude-team-orchestration

View GitHub Repository View Creator Repositories

Download

Run Skill in Manus

Related occupationsSOC

Based on SOC occupation classification

Software DevelopersComputer and Mathematical Occupations·SOC 15-1252

File Explorer

3 files

SKILL.md

readonly

More from this repository

same repository

agent-types

zircote/claude-team-orchestration

Choose the right agent type for each task including built-in agents (Bash, Explore, Plan, general-purpose) and plugin agents (review, research, refactoring, SDLC). Use when selecting agent types, understanding agent capabilities, or matching agents to tasks.

2026-03-218

messaging

zircote/claude-team-orchestration

Send messages between agents using SendMessage including direct messages, broadcasts, shutdown requests/responses, and plan approvals. Use when communicating between agents, understanding message formats, or handling structured protocol messages.

2026-03-218

orchestrating

zircote/claude-team-orchestration

Master multi-agent orchestration using Claude Code's agent teams and task system. Use when coordinating multiple agents, running parallel code reviews, creating pipeline workflows with dependencies, building self-organizing task queues, or any task benefiting from divide-and-conquer patterns. Routes to specialized sub-skills for team management, tasks, messaging, patterns, backends, and error handling.

2026-03-218

orchestration-patterns

zircote/claude-team-orchestration

Apply proven orchestration patterns for agent teams including parallel specialists, pipelines, swarms, research+implementation, plan approval, and multi-file refactoring. Use when choosing a team structure, designing workflows, or implementing specific coordination patterns.

2026-03-218

task-system

zircote/claude-team-orchestration

Manage shared task lists for agent teams including creating tasks, setting dependencies, claiming work, and tracking progress. Use when creating work items, building task pipelines, coordinating task ownership, or managing task dependencies.

2026-03-218

jsonl-log-analyzer

zircote/claude-team-orchestration

Analyze large JSONL log files using schema-aware partitioned analysis. Discovers field schema, generates tailored jq extraction recipes, and orchestrates parallel chunk analysts with synthesis. Use when processing JSONL logs exceeding context limits, performing log analytics, or investigating incident logs.

2026-03-198

name	error-handling
description	Debug and recover from agent team errors including common errors, hooks for quality gates, known limitations, and recovery strategies. Use when encountering team errors, enforcing quality gates with hooks, understanding limitations, or debugging agent issues.

Error Handling

Experimental: Agent teams are disabled by default. Enable with CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS in your settings.json or environment.

Debug, recover from, and prevent common agent team errors. Includes hooks for quality enforcement and known limitations.

Related skills:

Orchestrating - Primitives overview and quick reference
Team Management - Shutdown and cleanup procedures
Task System - Task status issues
Messaging - Message debugging
Spawn Backends - Backend troubleshooting

Common Errors

Error	Cause	Solution
"Cannot cleanup with active members"	Teammates still running	Shutdown all teammates first, wait for approval
"Already leading a team"	Team already exists	`TeamDelete()` first, or use different team name
"Agent not found"	Wrong teammate name	Read `config.json` for actual names
"Team does not exist"	No team created	Call `TeamCreate()` first
"team_name is required"	Missing team context	Provide `team_name` parameter
"Agent type not found"	Invalid subagent_type	Check available agents with proper prefix

Quality Gate Hooks

Use hooks to enforce rules when teammates finish work or tasks complete.

TeammateIdle Hook

Runs when a teammate is about to go idle. Exit with code 2 to send feedback and keep the teammate working.

{
  "hooks": {
    "TeammateIdle": [
      {
        "matcher": "",
        "hooks": [
          {
            "type": "command",
            "command": "python3 check_teammate_quality.py"
          }
        ]
      }
    ]
  }
}

Use cases:

Verify teammate completed all assigned tasks before going idle
Run linting or tests on teammate's changes
Enforce documentation requirements

Exit codes:

0 - Allow teammate to go idle normally
2 - Send feedback to teammate, keep them working

TaskCompleted Hook

Runs when a task is being marked complete. Exit with code 2 to prevent completion and send feedback.

{
  "hooks": {
    "TaskCompleted": [
      {
        "matcher": "",
        "hooks": [
          {
            "type": "command",
            "command": "python3 validate_task_completion.py"
          }
        ]
      }
    ]
  }
}

Use cases:

Verify tests pass before marking a task complete
Ensure code quality standards are met
Validate documentation was updated

Exit codes:

0 - Allow task completion
2 - Prevent completion, send feedback to teammate

Known Limitations

Agent teams are experimental. Current limitations:

No session resumption with in-process teammates: /resume and /rewind do not restore in-process teammates. After resuming, the lead may try to message teammates that no longer exist. Tell the lead to spawn new teammates.
Task status can lag: Teammates sometimes fail to mark tasks as completed, which blocks dependent tasks. Check whether work is done and update status manually, or tell the lead to nudge the teammate.
Shutdown can be slow: Teammates finish their current request or tool call before shutting down.
One team per session: A lead can only manage one team at a time. Clean up the current team before starting a new one.
No nested teams: Teammates cannot spawn their own teams or teammates. Only the lead can manage the team.
Lead is fixed: The session that creates the team is the lead for its lifetime. You cannot promote a teammate or transfer leadership.
Permissions set at spawn: All teammates start with the lead's permission mode. You can change individual modes after spawning, but cannot set per-teammate modes at spawn time.
Split panes require tmux or iTerm2: Default in-process mode works in any terminal. Split-pane mode isn't supported in VS Code's integrated terminal, Windows Terminal, or Ghostty.

Graceful Shutdown Sequence

See Team Management for the full shutdown procedure. In summary:

// 1. Request shutdown for all teammates
SendMessage({ to: "worker-1", message: { type: "shutdown_request", reason: "Done" } })
SendMessage({ to: "worker-2", message: { type: "shutdown_request", reason: "Done" } })

// 2. Wait for shutdown approvals

// 3. Verify no active members

// 4. Only then cleanup
TeamDelete()

Handling Crashed Teammates

Teammates have a 5-minute heartbeat timeout. If a teammate crashes:

They are automatically marked as inactive after timeout
Their tasks remain in the task list
Another teammate can claim their tasks
Cleanup will work after timeout expires

Recovery Strategies

Teammate Stops on Error

Teammates may stop after encountering errors instead of recovering.

Recovery:

Check their output using Shift+Up/Down (in-process) or click pane (split mode)
Give them additional instructions directly
Or spawn a replacement teammate to continue the work

Lead Starts Implementing Instead of Delegating

The lead sometimes starts doing work itself instead of waiting for teammates.

Recovery: Tell it to wait:

Wait for your teammates to complete their tasks before proceeding

Or enable delegate mode to restrict the lead to coordination-only tools.

Lead Shuts Down Prematurely

The lead may decide the team is finished before all tasks are complete.

Recovery: Tell it to keep going. You can also tell the lead to wait for teammates to finish before proceeding.

Task Appears Stuck

A task stays in pending even though its dependencies are done.

Recovery:

Check if the blocking task was actually marked completed
If work is done but status wasn't updated, update it manually
Tell the lead to nudge the teammate

Too Many Permission Prompts

Teammate permission requests bubble up to the lead.

Recovery: Pre-approve common operations in your permission settings before spawning teammates.

Orphaned tmux Sessions

A tmux session persists after the team ends.

Recovery:

tmux ls
tmux kill-session -t <session-name>

Debugging Commands

# Check team config
cat ~/.claude/teams/{team}/config.json | jq '.members[] | {name, agentType, backendType}'

# Check teammate inboxes
cat ~/.claude/teams/{team}/inboxes/{agent}.json | jq '.'

# List all teams
ls ~/.claude/teams/

# Check task states
cat ~/.claude/tasks/{team}/*.json | jq '{id, subject, status, owner, blockedBy}'

# Watch for new messages
tail -f ~/.claude/teams/{team}/inboxes/team-lead.json

Best Practices for Error Prevention

Build retry logic into worker prompts — crashed workers have a 5-minute heartbeat timeout, after which their tasks can be reclaimed
Avoid file conflicts — break work so each teammate owns a different set of files
Monitor and steer — check progress, redirect failing approaches, and synthesize findings as they arrive