with one click
baseline-runner
// Use this when the project needs real baseline results before or alongside the main model. Runs classical or literature-aligned baselines under the same protocol and writes a reproducible baseline summary.
// Use this when the project needs real baseline results before or alongside the main model. Runs classical or literature-aligned baselines under the same protocol and writes a reproducible baseline summary.
Use this when the user needs to choose between multiple ML routes after survey but before committing to implementation. Compares candidate approaches, selects one, records rejected routes, and keeps a fallback.
Use this when the project needs a dedicated data-quality review before model review. Checks data reality, split correctness, label health, leakage risk, shape consistency, and mock-data disclosure.
Use this when the user wants a draft paper, figure bundle, README, release page, or experiment artifact reviewed before sharing. Checks evidence binding, claim scope, captions, layout clarity, and release readiness.
Use this when the user wants to improve chart quality, standardize plotting style, regenerate release figures, or add captions/protocol notes. Normalizes fonts, colors, legends, units, and scope notes across Scientify figures.
Use this when the user wants to improve README, docs pages, or microsites so a new reader can understand what the project is, how to use it, what artifacts exist, and what the scope boundaries are within one screen.
[Read when prompt contains /research-experiment]
| name | baseline-runner |
| description | Use this when the project needs real baseline results before or alongside the main model. Runs classical or literature-aligned baselines under the same protocol and writes a reproducible baseline summary. |
| metadata | {"openclaw":{"emoji":"📏","requires":{"bins":["python3","uv"]}}} |
Don't ask permission. Just do it.
Use this skill when the project needs trustworthy baseline numbers instead of only evaluating the proposed model in isolation.
Outputs go to the workspace root.
plan_res.md already names baselinesproject/ already exists or a baseline implementation path is knownplan_res.mdsurvey_res.mdproject/ when the current project already has runnable codeIf plan_res.md is missing, stop and say: Run /research-plan first to complete the implementation plan.
baseline_res.mdexperiments/baselines/ when runnable artifacts are createdRead:
plan_res.mdsurvey_res.mdexperiment_res.md if it existsExtract:
Create a small comparison matrix with:
ready, needs adaptation, or missingUse references/baseline-matrix-template.md.
For each baseline:
All numeric results must come from actual execution logs or explicit imported evidence.
baseline_res.mdUse references/baseline-report-template.md.
The report must include: