Run any Skill in Manus with one click

ablation-planner

Use when main results pass result-to-claim (claim_supported=yes or partial) and ablation studies are needed for paper submission. A secondary Codex reviewer designs ablations from a reviewer's perspective, while the main agent checks feasibility and implementation.

Run Skill in Manus

Overview

Install command

npx skills add https://github.com/tqLi99/claude-skills-for-writing --skill ablation-planner

Copy and paste this command into Claude Code to install the skill

Source

tqLi99/claude-skills-for-writing

Stars0

Forks0

UpdatedApril 2, 2026 at 02:21

SKILL.md

readonly

name	ablation-planner
description	Use when main results pass result-to-claim (claim_supported=yes or partial) and ablation studies are needed for paper submission. A secondary Codex reviewer designs ablations from a reviewer's perspective, while the main agent checks feasibility and implementation.
argument-hint	["method-or-claim-scope"]
allowed-tools	Bash(*), Read, Write, Edit, Grep, Glob, Agent, mcp__codex__codex, mcp__codex__codex-reply

Ablation Planner

Systematically design ablation studies that answer the questions reviewers will ask.

Context: $ARGUMENTS

When to Use

Main results pass /result-to-claim with claim_supported = yes or partial
The user explicitly requests ablation planning
/auto-review-loop identifies missing ablations

Automation Policy

AUTO_PROCEED = false — Present the ablation plan and compute estimate before launching implementation.
HUMAN_CHECKPOINT = true — For expensive ablation suites, require explicit approval of cuts and run order.

Resolve automation defaults in this precedence order:

Inline command arguments
PROJECT_AUTOMATION.md in the project root
CLAUDE.md in the project root
The constants in this skill

Workflow

Before invoking the secondary reviewer, read ../shared-references/agent-role-charter.md and apply the Ablation Reviewer role.

Step 1: Prepare Context

Read available project files to build the full picture:

Method description and components from docs/research_contract.md, project notes, or AGENTS.md
Current experiment results from EXPERIMENT_LOG.md, EXPERIMENT_TRACKER.md, or W&B
Confirmed and intended claims from result-to-claim output or project notes
Available compute resources from project notes or environment config

Step 2: Secondary Reviewer Designs Ablations

spawn_agent:
  model: gpt-5.4
  reasoning_effort: xhigh
  message: |
    You are a skeptical senior reviewer planning ablation studies for a paper in
    multi-agent control / robotics / learning systems. Your job is to propose the
    minimum decisive ablations needed to survive peer review.

    Given this method and results, design ablations that:

    1. Isolate the contribution of each novel component
    2. Answer questions reviewers will definitely ask
    3. Test sensitivity to key hyperparameters
    4. Compare against natural alternative design choices

    Method: [description from project files]
    Components: [list of removable/replaceable components]
    Current results: [key metrics from experiments]
    Claims: [what we claim and current evidence]

    For each ablation, specify:
    - name: what to change
    - what_it_tests: the specific question this answers
    - expected_if_component_matters: what we predict if the component is important
    - priority: 1 (must-run) to 5 (nice-to-have)

    Also provide:
    - coverage_assessment
    - unnecessary_ablations
    - suggested_order
    - estimated_compute

    Do not pad the plan with ornamental sweeps. Favor reviewer-facing ablations that
    clearly sharpen the paper's claim boundaries.

Step 3: Parse Ablation Plan

Normalize the response into:

## Ablation Plan

### Component Ablations
| # | Name | What It Tests | Expected If Matters | Priority |
|---|------|---------------|---------------------|----------|

### Hyperparameter Sensitivity
| # | Parameter | Values to Test | What It Tests | Priority |
|---|-----------|---------------|---------------|----------|

### Design Choice Comparisons
| # | Name | What It Tests | Priority |
|---|------|---------------|----------|

### Coverage Assessment
[what reviewer questions these ablations answer]

### Unnecessary Ablations
[what to skip]

### Run Order
[optimized order]

### Estimated Compute
[total GPU-hours]

Step 4: Review Feasibility

Before running anything, check:

Compute budget
Which ablations are config-only vs code-change
Which ablations can run in parallel
What should be cut first if budget is too tight

Step 5: Implement and Run

Create configs/scripts for each ablation
Smoke test each ablation before the full run
Run in suggested order with descriptive names
Track results in EXPERIMENT_LOG.md
After completion, update findings with the ablation insights

Rules

The secondary reviewer leads the ablation design. Do not pre-filter the ablation list before the reviewer sees it.
Every ablation must have a clear what_it_tests and expected_if_component_matters.
Config-only ablations take priority over ablations that require code changes.
If total compute exceeds budget, propose cuts explicitly instead of silently dropping ablations.
Component ablations take priority over broad hyperparameter sweeps.
Record all ablation results, including negative ones.

More from this repository

same repository

auto-paper-improvement-loop

tqLi99/claude-skills-for-writing

Autonomously improve a generated paper via GPT-5.4 xhigh review → implement fixes → recompile, for 2 rounds. Use when user says "改论文", "improve paper", "论文润色循环", "auto improve", or wants to iteratively polish a generated paper.

2026-04-020

auto-review-loop

tqLi99/claude-skills-for-writing

Autonomous multi-round research review loop. Repeatedly reviews via Codex MCP, implements fixes, and re-reviews until positive assessment or max rounds reached. Use when user says "auto review loop", "review until it passes", or wants autonomous iterative improvement.

2026-04-020

comm-lit-review

tqLi99/claude-skills-for-writing

Communications-domain literature review and related-work search with database-aware source control. Use when the task is about communications, wireless, networking, satellite/NTN, Wi-Fi, cellular, transport protocols, congestion control, routing, scheduling, MAC/PHY, rate adaptation, channel estimation, beamforming, or communication-system research and the user wants papers, prior art, a survey, related work, or a landscape summary. Prioritize IEEE Xplore and ScienceDirect, prefer formal publications over preprints, and separate foundational work from recent progress.

2026-04-020

experiment-bridge

tqLi99/claude-skills-for-writing

Workflow 1.5: Bridge between idea discovery and auto review. Reads EXPERIMENT_PLAN.md, implements experiment code, deploys to GPU, collects initial results. Use when user says "实现实验", "implement experiments", "bridge", "从计划到跑实验", "deploy the plan", or has an experiment plan ready to execute.

2026-04-020

experiment-pipeline

tqLi99/claude-skills-for-writing

Stage 2 of the research workflow: turn a validated idea into implemented experiments, completed runs, analyzed results, and a writing-ready narrative package. Use when the user wants the experiment stage only.

2026-04-020

experiment-plan

tqLi99/claude-skills-for-writing

Turn a refined research proposal or method idea into a detailed, claim-driven experiment roadmap. Use after `research-refine`, or when the user asks for a detailed experiment plan, ablation matrix, evaluation protocol, run order, compute budget, or paper-ready validation that supports the core problem, novelty, simplicity, and any LLM / VLM / Diffusion / RL-based contribution.

2026-04-020

Source

tqLi99

tqLi99/claude-skills-for-writing

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

Useful forSOC

Engineering Teachers, PostsecondaryEducational Instruction and Library Occupations25-1032L4

name	ablation-planner
description	Use when main results pass result-to-claim (claim_supported=yes or partial) and ablation studies are needed for paper submission. A secondary Codex reviewer designs ablations from a reviewer's perspective, while the main agent checks feasibility and implementation.
argument-hint	["method-or-claim-scope"]
allowed-tools	Bash(*), Read, Write, Edit, Grep, Glob, Agent, mcp__codex__codex, mcp__codex__codex-reply

Ablation Planner

Systematically design ablation studies that answer the questions reviewers will ask.

Context: $ARGUMENTS

When to Use

Main results pass /result-to-claim with claim_supported = yes or partial
The user explicitly requests ablation planning
/auto-review-loop identifies missing ablations

Automation Policy

AUTO_PROCEED = false — Present the ablation plan and compute estimate before launching implementation.
HUMAN_CHECKPOINT = true — For expensive ablation suites, require explicit approval of cuts and run order.

Resolve automation defaults in this precedence order:

Inline command arguments
PROJECT_AUTOMATION.md in the project root
CLAUDE.md in the project root
The constants in this skill

Workflow

Before invoking the secondary reviewer, read ../shared-references/agent-role-charter.md and apply the Ablation Reviewer role.

Step 1: Prepare Context

Read available project files to build the full picture:

Method description and components from docs/research_contract.md, project notes, or AGENTS.md
Current experiment results from EXPERIMENT_LOG.md, EXPERIMENT_TRACKER.md, or W&B
Confirmed and intended claims from result-to-claim output or project notes
Available compute resources from project notes or environment config

Step 2: Secondary Reviewer Designs Ablations

spawn_agent:
  model: gpt-5.4
  reasoning_effort: xhigh
  message: |
    You are a skeptical senior reviewer planning ablation studies for a paper in
    multi-agent control / robotics / learning systems. Your job is to propose the
    minimum decisive ablations needed to survive peer review.

    Given this method and results, design ablations that:

    1. Isolate the contribution of each novel component
    2. Answer questions reviewers will definitely ask
    3. Test sensitivity to key hyperparameters
    4. Compare against natural alternative design choices

    Method: [description from project files]
    Components: [list of removable/replaceable components]
    Current results: [key metrics from experiments]
    Claims: [what we claim and current evidence]

    For each ablation, specify:
    - name: what to change
    - what_it_tests: the specific question this answers
    - expected_if_component_matters: what we predict if the component is important
    - priority: 1 (must-run) to 5 (nice-to-have)

    Also provide:
    - coverage_assessment
    - unnecessary_ablations
    - suggested_order
    - estimated_compute

    Do not pad the plan with ornamental sweeps. Favor reviewer-facing ablations that
    clearly sharpen the paper's claim boundaries.

Step 3: Parse Ablation Plan

Normalize the response into:

## Ablation Plan

### Component Ablations
| # | Name | What It Tests | Expected If Matters | Priority |
|---|------|---------------|---------------------|----------|

### Hyperparameter Sensitivity
| # | Parameter | Values to Test | What It Tests | Priority |
|---|-----------|---------------|---------------|----------|

### Design Choice Comparisons
| # | Name | What It Tests | Priority |
|---|------|---------------|----------|

### Coverage Assessment
[what reviewer questions these ablations answer]

### Unnecessary Ablations
[what to skip]

### Run Order
[optimized order]

### Estimated Compute
[total GPU-hours]

Step 4: Review Feasibility

Before running anything, check:

Compute budget
Which ablations are config-only vs code-change
Which ablations can run in parallel
What should be cut first if budget is too tight

Step 5: Implement and Run

Create configs/scripts for each ablation
Smoke test each ablation before the full run
Run in suggested order with descriptive names
Track results in EXPERIMENT_LOG.md
After completion, update findings with the ablation insights

Rules

The secondary reviewer leads the ablation design. Do not pre-filter the ablation list before the reviewer sees it.
Every ablation must have a clear what_it_tests and expected_if_component_matters.
Config-only ablations take priority over ablations that require code changes.
If total compute exceeds budget, propose cuts explicitly instead of silently dropping ablations.
Component ablations take priority over broad hyperparameter sweeps.
Record all ablation results, including negative ones.