Skip to main content
Execute qualquer Skill no Manus
com um clique

eval-harness

// Run the eval harness against a real Obsidian instance, monitor for the harness's known reliability gotchas (concurrent runs, CLI-bridge hangs, stale child processes), and bless a result as a baseline only when the run is clean. Use when the user asks to "run the evals", "bless a baseline", "measure the impact of <change> on solve rate", "run a model sweep", or similar. Has Obsidian-CLI side effects (drives the agent view, creates eval-scratch fixtures, may set chatModelName for the duration of the run); does NOT modify plugin source code or commit unless explicitly asked.

$ git log --oneline --stat
stars:453
forks:61
updated:22 de maio de 2026 às 21:59
SKILL.md
readonly