Run any Skill in Manus with one click

$pwd:

baseline-runner

Name: Baseline Runner
Author: tsingyuai

// Use this when the project needs real baseline results before or alongside the main model. Runs classical or literature-aligned baselines under the same protocol and writes a reproducible baseline summary.

Run Skill in Manus

$ git log --oneline --stat

stars:532

forks:50

updated:April 3, 2026 at 07:59

File Explorer

3 files

SKILL.md

readonly

name	baseline-runner
description	Use this when the project needs real baseline results before or alongside the main model. Runs classical or literature-aligned baselines under the same protocol and writes a reproducible baseline summary.
metadata	{"openclaw":{"emoji":"📏","requires":{"bins":["python3","uv"]}}}

Baseline Runner

Don't ask permission. Just do it.

Use this skill when the project needs trustworthy baseline numbers instead of only evaluating the proposed model in isolation.

Outputs go to the workspace root.

Use This When

plan_res.md already names baselines
project/ already exists or a baseline implementation path is known
the experiment stage needs matched comparison numbers

Do Not Use This When

the project has not finished survey or planning
no baseline method has been identified yet

Required Inputs

plan_res.md
survey_res.md
project/ when the current project already has runnable code

If plan_res.md is missing, stop and say: Run /research-plan first to complete the implementation plan.

Required Outputs

baseline_res.md
experiments/baselines/ when runnable artifacts are created

Workflow

Step 1: Read the Evaluation Contract

Read:

plan_res.md
survey_res.md
current experiment_res.md if it exists

Extract:

baseline names
evaluation metric
protocol or guardrail
dataset or workload assumptions

Step 2: Define the Baseline Matrix

Create a small comparison matrix with:

baseline name
source or basis
expected setup
metric
status: ready, needs adaptation, or missing

Use references/baseline-matrix-template.md.

Step 3: Run or Approximate Baselines Conservatively

For each baseline:

if code is runnable under the current workspace, run it
if only a lightweight adaptation is needed, implement the minimal adapter
if a baseline cannot be run honestly, mark it as unavailable instead of inventing numbers

All numeric results must come from actual execution logs or explicit imported evidence.

Step 4: Write `baseline_res.md`

Use references/baseline-report-template.md.

The report must include:

which baselines were attempted
which ones ran successfully
the exact metric values
the evaluation protocol
missing or partial baselines
the most comparable baseline for the current project

Rules

Never fabricate baseline numbers.
Keep the protocol aligned with the main experiment whenever possible.
If a baseline is only partly comparable, say so explicitly.
Prefer 2-3 strong baselines over a long weak list.

related-skills.json

same repository

algorithm-selection.md

from "tsingyuai/scientify"

Use this when the user needs to choose between multiple ML routes after survey but before committing to implementation. Compares candidate approaches, selects one, records rejected routes, and keeps a fallback.

2026-04-03532

dataset-validate.md

from "tsingyuai/scientify"

Use this when the project needs a dedicated data-quality review before model review. Checks data reality, split correctness, label health, leakage risk, shape consistency, and mock-data disclosure.

2026-04-03532

artifact-review.md

from "tsingyuai/scientify"

Use this when the user wants a draft paper, figure bundle, README, release page, or experiment artifact reviewed before sharing. Checks evidence binding, claim scope, captions, layout clarity, and release readiness.

2026-04-03532

figure-standardize.md

from "tsingyuai/scientify"

Use this when the user wants to improve chart quality, standardize plotting style, regenerate release figures, or add captions/protocol notes. Normalizes fonts, colors, legends, units, and scope notes across Scientify figures.

2026-04-03532

release-layout.md

from "tsingyuai/scientify"

Use this when the user wants to improve README, docs pages, or microsites so a new reader can understand what the project is, how to use it, what artifacts exist, and what the scope boundaries are within one screen.

2026-04-03532

research-experiment.md

from "tsingyuai/scientify"

[Read when prompt contains /research-experiment]

2026-04-03532

package.json

"author": "tsingyuai"

"repository": "tsingyuai/scientify"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Data ScientistsComputer and Mathematical Occupations15-2051L4

name	baseline-runner
description	Use this when the project needs real baseline results before or alongside the main model. Runs classical or literature-aligned baselines under the same protocol and writes a reproducible baseline summary.
metadata	{"openclaw":{"emoji":"📏","requires":{"bins":["python3","uv"]}}}

Baseline Runner

Don't ask permission. Just do it.

Use this skill when the project needs trustworthy baseline numbers instead of only evaluating the proposed model in isolation.

Outputs go to the workspace root.

Use This When

plan_res.md already names baselines
project/ already exists or a baseline implementation path is known
the experiment stage needs matched comparison numbers

Do Not Use This When

the project has not finished survey or planning
no baseline method has been identified yet

Required Inputs

plan_res.md
survey_res.md
project/ when the current project already has runnable code

If plan_res.md is missing, stop and say: Run /research-plan first to complete the implementation plan.

Required Outputs

baseline_res.md
experiments/baselines/ when runnable artifacts are created

Workflow

Step 1: Read the Evaluation Contract

Read:

plan_res.md
survey_res.md
current experiment_res.md if it exists

Extract:

baseline names
evaluation metric
protocol or guardrail
dataset or workload assumptions

Step 2: Define the Baseline Matrix

Create a small comparison matrix with:

baseline name
source or basis
expected setup
metric
status: ready, needs adaptation, or missing

Use references/baseline-matrix-template.md.

Step 3: Run or Approximate Baselines Conservatively

For each baseline:

if code is runnable under the current workspace, run it
if only a lightweight adaptation is needed, implement the minimal adapter
if a baseline cannot be run honestly, mark it as unavailable instead of inventing numbers

All numeric results must come from actual execution logs or explicit imported evidence.

Step 4: Write `baseline_res.md`

Use references/baseline-report-template.md.

The report must include:

which baselines were attempted
which ones ran successfully
the exact metric values
the evaluation protocol
missing or partial baselines
the most comparable baseline for the current project

Rules

Never fabricate baseline numbers.
Keep the protocol aligned with the main experiment whenever possible.
If a baseline is only partly comparable, say so explicitly.
Prefer 2-3 strong baselines over a long weak list.

baseline-runner

Baseline Runner

Use This When

Do Not Use This When

Required Inputs

Required Outputs

Workflow

Step 1: Read the Evaluation Contract

Step 2: Define the Baseline Matrix

Step 3: Run or Approximate Baselines Conservatively

Step 4: Write baseline_res.md

Rules

More from this repository

More from this repository

Baseline Runner

Use This When

Do Not Use This When

Required Inputs

Required Outputs

Workflow

Step 1: Read the Evaluation Contract

Step 2: Define the Baseline Matrix

Step 3: Run or Approximate Baselines Conservatively

Step 4: Write baseline_res.md

Rules

Step 4: Write `baseline_res.md`

Step 4: Write `baseline_res.md`