Skip to main content
Run any Skill in Manus
with one click
$pwd:

design-eval-case

// Design a stack-agnostic programming evaluation case for live AI coding evals. Use when the user wants to create a benchmark prompt, context, optional bootstrap or baseline instructions, and scorecard for any programming task, especially during webinars or workshops where participants choose the stack, task, constraints, and scoring criteria. Triggers on phrases like "design eval case", "zaprojektuj eval", "stworz benchmark case", "live eval design", "scorecard dla zadania", "programming eval", or requests to define an AI coding benchmark from scratch.

$ git log --oneline --stat
stars:5
forks:4
updated:May 6, 2026 at 13:39
SKILL.md
readonly