Skip to main content
在 Manus 中运行任何 Skill
一键导入

llm-judge

// AI quality judge that scores agent responses 0-10 across helpfulness, accuracy, completeness, and clarity. Use when evaluating multi-agent output or implementing LLM-as-judge quality gates.

$ git log --oneline --stat
stars:3,771
forks:759
updated:2026年5月5日 19:35
SKILL.md
readonly