Skip to main content
Run any Skill in Manus
with one click

llm-eval-type-selector

// Use this skill when a developer wants to decide what type of evaluation to build for their AI system. Triggers on: "should I use a rule or a judge", "what type of eval should I build", "decide eval type", "judge vs programmatic rule", "LLM-as-judge vs rule-based eval", "which evaluation type should I use", "how do I evaluate [X]", "what eval should I use for this failure", "is this a rule or a judge", "how should I evaluate my AI automatically", "what kind of eval fits this issue". Takes one or more failure modes or quality dimensions and returns a concrete type recommendation โ€” programmatic rule, LLM-as-judge, or composite โ€” with rationale and a suggested implementation path.

$ git log --oneline --stat
stars:14
forks:2
updated:April 23, 2026 at 14:31
SKILL.md
readonly