Skip to main content
Manus에서 모든 스킬 실행
원클릭으로

sc-evaluate

LLM pipeline evaluation with oracle judge scoring. Runs prompts against gold standard datasets, evaluates output quality via LLM-as-judge, and generates scored reports with improvement recommendations.

개요

LLM pipeline evaluation with oracle judge scoring. Runs prompts against gold standard datasets, evaluates output quality via LLM-as-judge, and generates scored reports with improvement recommendations.

설치 명령
npx skills add https://github.com/Tony363/SuperClaude --skill sc-evaluate

이 명령을 Claude Code에 복사하여 붙여넣어 스킬을 설치하세요

스타18
포크2
업데이트2026년 3월 2일 00:23
SKILL.md
readonly