Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

agentbench-eval

// AgentBench evaluation harness for claudemem. Covers pre-indexed repos, experiment conditions, running benchmarks, analyzing results, and managing index archives. Use when working on eval infrastructure, running experiments, or interpreting benchmark results.

$ git log --oneline --stat
stars:٣٩
forks:٦
updated:٤ مارس ٢٠٢٦ في ١٠:٢٤
SKILL.md
readonly