Skip to main content
Jeden Skill in Manus ausführen
mit einem Klick

eval-harness

// Use when you need to evaluate an LLM pipeline or AI feature systematically — sets up an eval harness with test cases, scoring rubrics, and pass/fail tracking rather than one-off manual spot-checks

$ git log --oneline --stat
stars:34
forks:10
updated:29. Mai 2026 um 07:11
SKILL.md
readonly