Skip to main content
在 Manus 中运行任何 Skill
一键导入

skill-eval

// Run the agentic evaluation repo for a target skill. Use when asked to execute repo-defined suites, collect evidence, write per-case results, and produce a short audit report for the target skill. Also supports evaluation modes: AB tests, subjective scoring, and vendor comparisons via evaluation YAML files in the eval repo.

$ git log --oneline --stat
stars:1
forks:0
updated:2026年5月22日 20:36
文件资源管理器
18 个文件
SKILL.md
readonly