Skip to main content
Jeden Skill in Manus ausführen
mit einem Klick

agent-evaluation-framework

// Workflow for evaluating and refining agent debugging capabilities using designated test cases and Swarm principles. Use when evaluating subagent performance or creating benchmarks. Do not use for regular bug fixing.

$ git log --oneline --stat
stars:25.042
forks:4.274
updated:12. Mai 2026 um 10:16
SKILL.md
readonly