Skip to main content
Run any Skill in Manus
with one click

llm-eval

// LLM output evaluation pipeline: audit evals, failure analysis, synthetic data, LLM-as-Judge, RAG eval, annotation design. Triggers on: llm eval, evaluate ai, eval pipeline, judge calibration, rag eval, ai quality, /llm-eval.

$ git log --oneline --stat
stars:0
forks:1
updated:April 18, 2026 at 17:40
File Explorer
2 files
SKILL.md
readonly