Skip to main content
在 Manus 中运行任何 Skill
一键导入

evaluations

// Set up comprehensive evaluations for your AI agent with LangWatch — experiments (batch testing), evaluators (scoring functions), datasets, online evaluation (production monitoring), and guardrails (real-time blocking). Supports both code (SDK) and platform (CLI) approaches. Use when the user wants to evaluate, test, benchmark, monitor, or safeguard their agent.

$ git log --oneline --stat
stars:2
forks:1
updated:2026年4月24日 09:38
SKILL.md
readonly