Skip to main content
Run any Skill in Manus
with one click

agent-evaluation-framework

// Workflow for evaluating and refining agent debugging capabilities using designated test cases and Swarm principles. Use when evaluating subagent performance or creating benchmarks. Do not use for regular bug fixing.

$ git log --oneline --stat
stars:25,042
forks:4,274
updated:May 12, 2026 at 10:16
SKILL.md
readonly