Skip to main content
Execute qualquer Skill no Manus
com um clique

validate-evaluator

Guides validation of evaluators, especially LLM judges, against labeled examples. Use when evaluator quality is uncertain, judge scores seem inconsistent, or you need to check whether the evaluator is biased, noisy, or misaligned.

Estrelas3
Forks0
Atualizado29 de abril de 2026 às 14:29
SKILL.md
readonly