Skip to main content
Ejecuta cualquier Skill en Manus
con un clic

byob

// Create custom LLM evaluation benchmarks using the BYOB decorator framework. Use when the user wants to (1) create a new benchmark from a dataset, (2) pick or write a scorer, (3) compile and run a BYOB benchmark, (4) containerize a benchmark, or (5) use LLM-as-Judge evaluation. Triggers on mentions of BYOB, custom benchmark, bring your own benchmark, scorer, or benchmark compilation.

$ git log --oneline --stat
stars:283
forks:48
updated:7 de mayo de 2026, 07:26
SKILL.md
readonly