Skip to main content
Manusで任意のスキルを実行
ワンクリックで

validate-evaluator

Guides validation of evaluators, especially LLM judges, against labeled examples. Use when evaluator quality is uncertain, judge scores seem inconsistent, or you need to check whether the evaluator is biased, noisy, or misaligned.

スター3
フォーク0
更新日2026年4月29日 14:29
SKILL.md
readonly