Skip to main content
Run any Skill in Manus
with one click
$pwd:

evaluate-model

// Load the latest model checkpoint, run evaluation on the test set, and generate a metrics report with confusion matrix. Use this after training to assess model performance or to re-evaluate a specific checkpoint.

$ git log --oneline --stat
stars:41
forks:6
updated:February 23, 2026 at 03:44
SKILL.md
readonly