Skip to main content
Run any Skill in Manus
with one click

sc-evaluate

LLM pipeline evaluation with oracle judge scoring. Runs prompts against gold standard datasets, evaluates output quality via LLM-as-judge, and generates scored reports with improvement recommendations.

Overview

LLM pipeline evaluation with oracle judge scoring. Runs prompts against gold standard datasets, evaluates output quality via LLM-as-judge, and generates scored reports with improvement recommendations.

Install command
npx skills add https://github.com/Tony363/SuperClaude --skill sc-evaluate

Copy and paste this command into Claude Code to install the skill

Stars18
Forks2
UpdatedMarch 2, 2026 at 00:23
SKILL.md
readonly