Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

evaluation

// Complete reference for the config-first model evaluation system. Covers the Evaluator CLI, assertion-driven YAML scenarios, response views, backend configuration, presets, scoring, LLM-as-judge, model comparison, and HuggingFace integration. Use when evaluating models, writing test prompts, comparing training runs, or interpreting eval results. This skill is about USING the evaluation system via CLI and YAML.

$ git log --oneline --stat
stars:٢٣
forks:٣
updated:٢٤ أبريل ٢٠٢٦ في ١٧:٥٦
مستكشف الملفات
6 ملفات
SKILL.md
readonly