Skip to main content
Execute qualquer Skill no Manus
com um clique
$pwd:

evaluation

// This skill should be used when building agent evaluation systems: deterministic checks, regression suites, multi-dimensional rubrics, quality gates, production monitoring, baseline comparison, and outcome measurement for agent pipelines.

$ git log --oneline --stat
stars:15.902
forks:1.286
updated:19 de maio de 2026 às 06:08
Explorador de arquivos
3 arquivos
SKILL.md
readonly