Skip to main content
Run any Skill in Manus
with one click

learning-zone-energy-data-selection

Learning-Zone Energy methodology — online data selection for efficient RL post-training of LLMs. Identifies the 'learning zone' where samples have optimal difficulty for gradient signal, replacing uniform rollout/gradient budgets in GRPO/DAPO. Use when: optimizing RL post-training compute, data selection for LLM RL, GRPO efficiency, DAPO optimization, RL training data prioritization, reasoning model post-training. Activation: learning zone energy, online data selection RL, GRPO data efficiency, RL post-training optimization, rollout budget allocation, medium-difficulty sample RL.

Overview

Learning-Zone Energy methodology — online data selection for efficient RL post-training of LLMs. Identifies the 'learning zone' where samples have optimal difficulty for gradient signal, replacing uniform rollout/gradient budgets in GRPO/DAPO. Use when: optimizing RL post-training compute, data selection for LLM RL, GRPO efficiency, DAPO optimization, RL training data prioritization, reasoning model post-training. Activation: learning zone energy, online data selection RL, GRPO data efficiency, RL post-training optimization, rollout budget allocation, medium-difficulty sample RL.

Install command
npx skills add https://github.com/hiyenwong/ai_collection --skill learning-zone-energy-data-selection

Copy and paste this command into Claude Code to install the skill

Stars1
Forks0
UpdatedJune 4, 2026 at 02:00
SKILL.md
readonly