Skip to main content
在 Manus 中运行任何 Skill
一键导入
$pwd:

agent-evaluation-mlflow

// Implement agent evaluation and safety gates using MLflow 3.x. Use for creating LLM-as-Judge scorers, evaluation datasets, quality gates, tracing, and continuous evaluation. Triggers on "evaluate agent", "MLflow scorer", "LLM judge", "safety evaluation", "quality gate", "agent testing", "hallucination detection", or when implementing spec/010-agent-evaluation.md requirements.

$ git log --oneline --stat
stars:2
forks:0
updated:2025年12月19日 04:22
文件资源管理器
2 个文件
SKILL.md
readonly