Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic
$pwd:

benchmark-report-writer

// Produces publishable markdown benchmark reports from eval results, modeled on the structure and tone of frontier AI lab reports (Anthropic, Cursor, FrontierSWE, BaxBench). Use this when the user wants to write up benchmark findings, turn eval results into a blog post or technical report, publish a model/agent comparison, announce a new eval, or share benchmark data publicly. Triggers on phrases like "write a benchmark report", "turn these results into a post", "publish our evals", "draft a writeup of the benchmark", "create a model comparison report", "generate a benchmark blog post". Use this skill even if the user just says "write this up" or "make a report" in the context of benchmark/eval results. Do NOT use for internal changelog entries, release notes, PR descriptions, or academic paper drafts.

$ git log --oneline --stat
stars:1
forks:1
updated:24 avril 2026 à 08:15
Explorateur de fichiers
6 fichiers
SKILL.md
readonly