| name | skill-rm-unifying-reward-modeling |
| description | Skill Reward Model (Skill-RM) framework for unified reward modeling in RL pipelines. Treats reward computation as structured agentic task orchestrating heterogeneous evaluation criteria. |
Skill-RM: Unified Reward Modeling via Agent Skill
Core Concept
Reformulate reward modeling as execution of reusable Reward-Evaluation Skill. Instead of static evaluation, treat reward computation as structured agentic task that dynamically selects and aggregates evidence.
Key Innovation
- Unified interface: rule-based verifiers, ground-truth references, procedural checklists, complex rubrics
- Dynamic orchestration: Select evidence tailored to each input's requirements
- Transparency & consistency: Beyond static evaluation, structured task execution
Implementation
- Reward-Evaluation Skill: Reusable skill module for reward computation
- Evidence aggregation: Dynamic selection based on input requirements
- Integration: RL pipelines, RFT, best-of-N selection
Applications
- RLHF pipelines
- Reinforced fine-tuning (RFT)
- Best-of-N selection
- Reward benchmarks
Results
Consistently outperforms traditional judge baselines on reward benchmarks and downstream RL applications.
Source
Activation Keywords
skill-rm, reward modeling, heterogeneous evaluation, RLHF reward, reward orchestration, agentic reward