Skip to main content
在 Manus 中运行任何 Skill
一键导入
$pwd:

evaluate-model

// Load the latest model checkpoint, run evaluation on the test set, and generate a metrics report with confusion matrix. Use this after training to assess model performance or to re-evaluate a specific checkpoint.

$ git log --oneline --stat
stars:41
forks:6
updated:2026年2月23日 03:44
SKILL.md
readonly