Skip to main content
在 Manus 中运行任何 Skill
一键导入

sc-evaluate

LLM pipeline evaluation with oracle judge scoring. Runs prompts against gold standard datasets, evaluates output quality via LLM-as-judge, and generates scored reports with improvement recommendations.

概览

LLM pipeline evaluation with oracle judge scoring. Runs prompts against gold standard datasets, evaluates output quality via LLM-as-judge, and generates scored reports with improvement recommendations.

安装命令
npx skills add https://github.com/Tony363/SuperClaude --skill sc-evaluate

复制此命令并粘贴到 Claude Code 中以安装该技能

星标18
分支2
更新时间2026年3月2日 00:23
SKILL.md
readonly