Skip to main content
在 Manus 中运行任何 Skill
一键导入

eval-llm

// Evaluate a change to the LLM extraction pipeline against a clean baseline. Use when the user wants to test, validate, or assess a prompt/chunking/post-processing/provider change before committing. Produces a merge / regression / defer verdict.

$ git log --oneline --stat
stars:14,065
forks:844
updated:2026年4月27日 19:34
SKILL.md
readonly