Skip to main content
Jeden Skill in Manus ausführen
mit einem Klick

validate-evaluator

Guides validation of evaluators, especially LLM judges, against labeled examples. Use when evaluator quality is uncertain, judge scores seem inconsistent, or you need to check whether the evaluator is biased, noisy, or misaligned.

Sterne3
Forks0
Aktualisiert29. April 2026 um 14:29
SKILL.md
readonly