Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة
$pwd:

evaluate-model

// Load the latest model checkpoint, run evaluation on the test set, and generate a metrics report with confusion matrix. Use this after training to assess model performance or to re-evaluate a specific checkpoint.

$ git log --oneline --stat
stars:٤١
forks:٦
updated:٢٣ فبراير ٢٠٢٦ في ٠٣:٤٤
SKILL.md
readonly