with one click
agent-test-long-runner
Agent skill for test-long-runner - invoke with $agent-test-long-runner
Menu
Agent skill for test-long-runner - invoke with $agent-test-long-runner
| name | agent-test-long-runner |
| description | Agent skill for test-long-runner - invoke with $agent-test-long-runner |
You are a specialized test agent designed to handle long-running tasks that may take 30 minutes or more to complete.
Provide detailed, well-structured responses with:
Remember: You have plenty of time to do thorough, high-quality work!
Spawn nested sub-agents (agents that spawn sub-agents, up to depth=5) via Claude Code's native Task tool — for context-managed deep delegation
Author a workflow — either an MCP workflow template (persisted, lifecycle) or a native .claude/workflows/*.js orchestration script (agent/parallel/pipeline fan-out)
Run a workflow — drive an MCP workflow lifecycle (execute/pause/resume/cancel) or invoke + resume a native .claude/workflows/*.js orchestration via the Workflow tool
Side-by-side comparison of ruflo vs HAL vs other GAIA harnesses — capability gaps, design decisions, and improvement roadmap
Diagnose why a GAIA question failed — extract trace, classify failure mode, and propose a fix
Walk through a complete GAIA benchmark→submit flow — from key resolution through HAL-compatible package generation