Skip to main content
在 Manus 中运行任何 Skill
一键导入
$pwd:

horangi-fails

// Deep-dive error pattern analysis for a single (model, benchmark) pair using Weave traces. Surfaces how/why the model is getting answers wrong — answer bias, format violations, language mixing, and 3-5 representative failure samples. Invoke when the user asks to analyze wrong answers / failure patterns for a specific benchmark (e.g. "analyze errors in <bench>", "<model>의 <benchmark> 오답 패턴 분석", "틀린 문제 경향"). Commonly invoked as a follow-up to `horangi-analyze` when that skill flags a weak category.

$ git log --oneline --stat
stars:3
forks:0
updated:2026年4月15日 03:48
SKILL.md
readonly