Skip to main content
تشغيل أي مهارة في Manus
بنقرة واحدة

byob

// Create custom LLM evaluation benchmarks using the BYOB decorator framework. Use when the user wants to (1) create a new benchmark from a dataset, (2) pick or write a scorer, (3) compile and run a BYOB benchmark, (4) containerize a benchmark, or (5) use LLM-as-Judge evaluation. Triggers on mentions of BYOB, custom benchmark, bring your own benchmark, scorer, or benchmark compilation.

$ git log --oneline --stat
stars:٢٨٣
forks:٤٨
updated:٧ مايو ٢٠٢٦ في ٠٧:٢٦
SKILL.md
readonly