Run any Skill in Manus with one click

$pwd:

add-rollout-function

Name: Add Rollout Function
Author: THUDM

// Guide for adding a new rollout function in slime and wiring it through --rollout-function-path. Use when user wants to implement custom rollout data generation logic, custom train/eval rollout outputs, or migrate from the default sglang rollout path.

Run Skill in Manus

$ git log --oneline --stat

stars:5,863

forks:844

updated:March 2, 2026 at 03:01

SKILL.md

readonly

name	add-rollout-function
description	Guide for adding a new rollout function in slime and wiring it through --rollout-function-path. Use when user wants to implement custom rollout data generation logic, custom train/eval rollout outputs, or migrate from the default sglang rollout path.

Add Rollout Function

Implement a custom rollout function and integrate it safely with slime training/eval flow.

When to Use

Use this skill when:

User asks to add a new rollout task or rollout generation function
User asks to replace default slime.rollout.sglang_rollout.generate_rollout
User asks to customize train/eval data generation behavior

Step-by-Step Guide

Step 1: Choose the Right Starting Point

Start from one of these references:

Async RL-style rollout: slime/rollout/sglang_rollout.py
Simple SFT-style rollout: slime/rollout/sft_rollout.py

If the task needs engine-based async generation and rewards, use the sglang path as base. If the task is file/buffer-driven and simple, use sft path as base.

Step 2: Create the New Rollout Module

Create a new file, for example: slime/rollout/<your_rollout>.py

Required callable signature:

def generate_rollout(args, rollout_id, data_source, evaluation=False) -> RolloutFnTrainOutput | RolloutFnEvalOutput:
    ...

Return types are defined in slime/rollout/base_types.py.

Step 3: Implement Train and Eval Branches Explicitly

For training (evaluation=False), return RolloutFnTrainOutput(samples=..., metrics=...)
For evaluation (evaluation=True), return RolloutFnEvalOutput(data=..., metrics=...)

Minimal skeleton:

from slime.rollout.base_types import RolloutFnTrainOutput, RolloutFnEvalOutput


def generate_rollout(args, rollout_id, data_source, evaluation=False):
    if evaluation:
        result = {
            "custom_eval": {
                "rewards": [],
                "truncated": [],
                "samples": [],
            }
        }
        return RolloutFnEvalOutput(data=result)

    groups = data_source.get_samples(args.rollout_batch_size)
    # fill Sample fields needed by training: tokens/response_length/reward/status (+ loss_mask when needed)
    return RolloutFnTrainOutput(samples=groups)

Step 4: Keep Data Contract Compatible

For each generated sample, ensure required training fields are populated consistently with your objective:

tokens
response_length
reward (or reward dict if your setup uses keyed rewards)
status

If partial rollout or masking logic is involved, keep loss_mask semantics consistent with existing behavior.

Step 5: Wire Through Arguments

Set your function path via CLI:

--rollout-function-path slime.rollout.<your_rollout>.generate_rollout

The default and signature expectation are documented in:

slime/utils/arguments.py
docs/en/get_started/customization.md

Common Mistakes

Returning raw Python lists/dicts with mismatched schema in custom path
Implementing only training branch and forgetting evaluation branch
Generating samples without required fields (tokens, response_length, reward, status)
Using blocking-heavy logic in high-frequency rollout paths without batching/concurrency control

Reference Locations

Default rollout: slime/rollout/sglang_rollout.py
Simple custom example: slime/rollout/sft_rollout.py
Output dataclasses: slime/rollout/base_types.py
Wiring/loading: slime/ray/rollout.py
Argument definition: slime/utils/arguments.py
Customization docs: docs/en/get_started/customization.md

related-skills.json

same repository

add-dynamic-filter.md

from "THUDM/slime"

Guide for adding dynamic/filter hooks in slime rollout pipeline. Use when user wants sample-group selection during rollout, buffer filtering before training, or per-sample masking/processing hooks.

2026-03-025.9k

add-eval-dataset-config.md

from "THUDM/slime"

Guide for adding and validating evaluation dataset configuration in slime. Use when user wants to configure eval datasets via --eval-config or --eval-prompt-data, add per-dataset overrides, or customize evaluation rollout behavior.

2026-03-025.9k

add-reward-function.md

from "THUDM/slime"

Guide for adding a custom reward function in slime and wiring it through --custom-rm-path (and optional reward post-processing). Use when user wants new reward logic, remote/service reward integration, or task-specific reward shaping.

2026-03-025.9k

add-tests-and-ci.md

from "THUDM/slime"

Guide for adding or updating slime tests and CI wiring. Use when tasks require new test cases, CI registration, test matrix updates, or workflow template changes.

2026-03-025.9k

package.json

"author": "THUDM"

"repository": "THUDM/slime"

View GitHub Repository View Creator Repositories

$ install --global

$ download --local

Run Skill in Manus

$ useful --forSOC

Data ScientistsComputer and Mathematical Occupations15-2051L4

Software DevelopersL4

name	add-rollout-function
description	Guide for adding a new rollout function in slime and wiring it through --rollout-function-path. Use when user wants to implement custom rollout data generation logic, custom train/eval rollout outputs, or migrate from the default sglang rollout path.

Add Rollout Function

Implement a custom rollout function and integrate it safely with slime training/eval flow.

When to Use

Use this skill when:

User asks to add a new rollout task or rollout generation function
User asks to replace default slime.rollout.sglang_rollout.generate_rollout
User asks to customize train/eval data generation behavior

Step-by-Step Guide

Step 1: Choose the Right Starting Point

Start from one of these references:

Async RL-style rollout: slime/rollout/sglang_rollout.py
Simple SFT-style rollout: slime/rollout/sft_rollout.py

If the task needs engine-based async generation and rewards, use the sglang path as base. If the task is file/buffer-driven and simple, use sft path as base.

Step 2: Create the New Rollout Module

Create a new file, for example: slime/rollout/<your_rollout>.py

Required callable signature:

def generate_rollout(args, rollout_id, data_source, evaluation=False) -> RolloutFnTrainOutput | RolloutFnEvalOutput:
    ...

Return types are defined in slime/rollout/base_types.py.

Step 3: Implement Train and Eval Branches Explicitly

For training (evaluation=False), return RolloutFnTrainOutput(samples=..., metrics=...)
For evaluation (evaluation=True), return RolloutFnEvalOutput(data=..., metrics=...)

Minimal skeleton:

from slime.rollout.base_types import RolloutFnTrainOutput, RolloutFnEvalOutput


def generate_rollout(args, rollout_id, data_source, evaluation=False):
    if evaluation:
        result = {
            "custom_eval": {
                "rewards": [],
                "truncated": [],
                "samples": [],
            }
        }
        return RolloutFnEvalOutput(data=result)

    groups = data_source.get_samples(args.rollout_batch_size)
    # fill Sample fields needed by training: tokens/response_length/reward/status (+ loss_mask when needed)
    return RolloutFnTrainOutput(samples=groups)

Step 4: Keep Data Contract Compatible

For each generated sample, ensure required training fields are populated consistently with your objective:

tokens
response_length
reward (or reward dict if your setup uses keyed rewards)
status

If partial rollout or masking logic is involved, keep loss_mask semantics consistent with existing behavior.

Step 5: Wire Through Arguments

Set your function path via CLI:

--rollout-function-path slime.rollout.<your_rollout>.generate_rollout

The default and signature expectation are documented in:

slime/utils/arguments.py
docs/en/get_started/customization.md

Common Mistakes

Returning raw Python lists/dicts with mismatched schema in custom path
Implementing only training branch and forgetting evaluation branch
Generating samples without required fields (tokens, response_length, reward, status)
Using blocking-heavy logic in high-frequency rollout paths without batching/concurrency control

Reference Locations

Default rollout: slime/rollout/sglang_rollout.py
Simple custom example: slime/rollout/sft_rollout.py
Output dataclasses: slime/rollout/base_types.py
Wiring/loading: slime/ray/rollout.py
Argument definition: slime/utils/arguments.py
Customization docs: docs/en/get_started/customization.md

add-rollout-function

Add Rollout Function

When to Use

Step-by-Step Guide

Step 1: Choose the Right Starting Point

Step 2: Create the New Rollout Module

Step 3: Implement Train and Eval Branches Explicitly

Step 4: Keep Data Contract Compatible

Step 5: Wire Through Arguments

Common Mistakes

Reference Locations

More from this repository

More from this repository

Add Rollout Function

When to Use

Step-by-Step Guide

Step 1: Choose the Right Starting Point

Step 2: Create the New Rollout Module

Step 3: Implement Train and Eval Branches Explicitly

Step 4: Keep Data Contract Compatible

Step 5: Wire Through Arguments

Common Mistakes

Reference Locations