Skip to main content
Run any Skill in Manus
with one click
$pwd:

llm-evaluation

// Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or establishing evaluation frameworks.

$ git log --oneline --stat
stars:1
forks:0
updated:May 17, 2026 at 11:10
SKILL.md
readonly