Skip to main content
Run any Skill in Manus
with one click
$pwd:

advanced-evaluation

// This skill should be used for advanced LLM evaluation: LLM-as-judge systems, direct scoring, pairwise comparison, rubric calibration, evaluator bias mitigation, confidence scoring, and automated quality assessment.

$ git log --oneline --stat
stars:15,902
forks:1,286
updated:May 19, 2026 at 06:08
File Explorer
6 files
SKILL.md
readonly