Skip to main content
Ejecuta cualquier Skill en Manus
con un clic
$pwd:

gsm8k-eval

// GSM8K evaluation protocol: answer extraction (####, \boxed, CoT), accuracy scoring, prompt formatting, few-shot exemplars, dataset loading, pitfalls. Use when: GSM8K, grade school math, openai/gsm8k, #### delimiter, parse_gsm8k_answer, detect_answer_failure, load_gsm8k, format_chat, math benchmark scoring, gsm8k few-shot, chain-of-thought eval.

$ git log --oneline --stat
stars:2
forks:0
updated:23 de marzo de 2026, 21:16
Explorador de archivos
3 archivos
SKILL.md
readonly