Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

llm-eval-type-selector

// Use this skill when a developer wants to decide what type of evaluation to build for their AI system. Triggers on: "should I use a rule or a judge", "what type of eval should I build", "decide eval type", "judge vs programmatic rule", "LLM-as-judge vs rule-based eval", "which evaluation type should I use", "how do I evaluate [X]", "what eval should I use for this failure", "is this a rule or a judge", "how should I evaluate my AI automatically", "what kind of eval fits this issue". Takes one or more failure modes or quality dimensions and returns a concrete type recommendation — programmatic rule, LLM-as-judge, or composite — with rationale and a suggested implementation path.

$ git log --oneline --stat
stars:١٤
forks:٢
updated:٢٣ أبريل ٢٠٢٦ في ١٤:٣١
SKILL.md
readonly