Skip to main content
Run any Skill in Manus
with one click

ml-model-eval-benchmark

Stars1
Forks1
UpdatedMarch 13, 2026 at 04:04

Compare model candidates using weighted metrics and deterministic ranking outputs. Use for benchmark leaderboards and model promotion decisions.

Installation

Install with Codex or Claude Copy this prompt, paste it into Codex, Claude, or another assistant, and let it review the skill page and install it for you.

File Explorer
5 files
SKILL.md
readonly