Skip to main content
Run any Skill in Manus
with one click

score-eval-attempt

// Score a completed programming evaluation attempt against a benchmark scorecard. Use when the user wants to evaluate a generated model attempt, produce eval-results.csv, inspect build/test/runtime behavior, or compare attempts using the scorecard created by design-eval-case. Triggers on phrases like "score eval attempt", "ocen probe", "evaluate attempt", "score model output", "wygeneruj eval-results", "ocen wynik modelu", or requests to grade an eval-attempts directory.

$ git log --oneline --stat
stars:5
forks:4
updated:May 6, 2026 at 10:55
SKILL.md
readonly