en un clic
dataset-validate
// Use this when the project needs a dedicated data-quality review before model review. Checks data reality, split correctness, label health, leakage risk, shape consistency, and mock-data disclosure.
// Use this when the project needs a dedicated data-quality review before model review. Checks data reality, split correctness, label health, leakage risk, shape consistency, and mock-data disclosure.
Use this when the user needs to choose between multiple ML routes after survey but before committing to implementation. Compares candidate approaches, selects one, records rejected routes, and keeps a fallback.
Use this when the project needs real baseline results before or alongside the main model. Runs classical or literature-aligned baselines under the same protocol and writes a reproducible baseline summary.
Use this when the user wants a draft paper, figure bundle, README, release page, or experiment artifact reviewed before sharing. Checks evidence binding, claim scope, captions, layout clarity, and release readiness.
Use this when the user wants to improve chart quality, standardize plotting style, regenerate release figures, or add captions/protocol notes. Normalizes fonts, colors, legends, units, and scope notes across Scientify figures.
Use this when the user wants to improve README, docs pages, or microsites so a new reader can understand what the project is, how to use it, what artifacts exist, and what the scope boundaries are within one screen.
[Read when prompt contains /research-experiment]
| name | dataset-validate |
| description | Use this when the project needs a dedicated data-quality review before model review. Checks data reality, split correctness, label health, leakage risk, shape consistency, and mock-data disclosure. |
| metadata | {"openclaw":{"emoji":"🗂️","requires":{"bins":["python3","uv"]}}} |
Don't ask permission. Just do it.
Use this skill before or alongside model implementation review when data quality needs to be checked separately from model quality.
Outputs go to the workspace root.
plan_res.md already existsplan_res.mdproject/ if a data pipeline already existssurvey_res.md when it defines dataset or protocol expectationsIf plan_res.md is missing, stop and say: Run /research-plan first to complete the implementation plan.
data_validation.mdRead:
plan_res.mdsurvey_res.md if presentproject/data/ if presentExtract:
Check:
Check:
plan_res.mdIf code exists, run lightweight inspection commands under the project environment to verify counts and sample structure.
data_validation.mdUse references/data-validation-template.md.
The report must include:
PASS, NEEDS_REVISION, or BLOCKED