Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

eval-audit

// Audit an LLM eval pipeline and surface problems: missing error analysis, unvalidated judges, vanity metrics, etc. Use when inheriting an eval system, when unsure whether evals are trustworthy, or as a starting point when no eval infrastructure exists. Do NOT use when the goal is to build a new evaluator from scratch (use error-analysis, write-judge-prompt, or validate-evaluator instead).

$ git log --oneline --stat
stars:١٬٣٣٣
forks:١٣٨
updated:٣ مارس ٢٠٢٦ في ٠٢:٥٣
SKILL.md
readonly