Skip to main content
Exécutez n'importe quel Skill dans Manus
en un clic

gsm8k-eval

// GSM8K evaluation protocol: answer extraction (####, \boxed, CoT), accuracy scoring, prompt formatting, few-shot exemplars, dataset loading, pitfalls. Use when: GSM8K, grade school math, openai/gsm8k, #### delimiter, parse_gsm8k_answer, detect_answer_failure, load_gsm8k, format_chat, math benchmark scoring, gsm8k few-shot, chain-of-thought eval.

$ git log --oneline --stat
stars:2
forks:0
updated:23 mars 2026 à 21:16
Explorateur de fichiers
3 fichiers
SKILL.md
readonly