Skip to main content
Jeden Skill in Manus ausführen
mit einem Klick
$pwd:

evaluate-model

// Load the latest model checkpoint, run evaluation on the test set, and generate a metrics report with confusion matrix. Use this after training to assess model performance or to re-evaluate a specific checkpoint.

$ git log --oneline --stat
stars:41
forks:6
updated:23. Februar 2026 um 03:44
SKILL.md
readonly