Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

eval-harness

// Use when you need to evaluate an LLM pipeline or AI feature systematically — sets up an eval harness with test cases, scoring rubrics, and pass/fail tracking rather than one-off manual spot-checks

$ git log --oneline --stat
stars:٣٤
forks:١٠
updated:٢٩ مايو ٢٠٢٦ في ٠٧:١١
SKILL.md
readonly