Skip to main content
Run any Skill in Manus
with one click

evaluations

// Set up comprehensive evaluations for your AI agent with LangWatch — experiments (batch testing), evaluators (scoring functions), datasets, online evaluation (production monitoring), and guardrails (real-time blocking). Supports both code (SDK) and platform (CLI) approaches. Use when the user wants to evaluate, test, benchmark, monitor, or safeguard their agent.

$ git log --oneline --stat
stars:2
forks:1
updated:April 24, 2026 at 09:38
SKILL.md
readonly