Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة
$pwd:

agent-evaluation

// Use this when you need to EVALUATE OR IMPROVE or OPTIMIZE an existing LLM agent's output quality - including improving tool selection accuracy, answer quality, reducing costs, or fixing issues where the agent gives wrong/incomplete responses. Evaluates agents systematically using MLflow evaluation with datasets, scorers, and tracing. IMPORTANT - Always also load the instrumenting-with-mlflow-tracing skill before starting any work. Covers end-to-end evaluation workflow or individual components (tracing setup, dataset creation, scorer definition, evaluation execution).

$ git log --oneline --stat
stars:٤٤
forks:١٣
updated:٣١ مارس ٢٠٢٦ في ١٦:٢٨
مستكشف الملفات
18 ملفات
SKILL.md
readonly