Skip to main content
Run any Skill in Manus
with one click

reward-uncertainty-diverse-behaviour

Reformulate RL objective using reward function distribution instead of scalar reward. Apply non-linear objective over action sets to induce calibrated behavioural diversity without sacrificing expected reward.

Overview

Reformulate RL objective using reward function distribution instead of scalar reward. Apply non-linear objective over action sets to induce calibrated behavioural diversity without sacrificing expected reward.

Install command
npx skills add https://github.com/hiyenwong/ai_collection --skill reward-uncertainty-diverse-behaviour

Copy and paste this command into Claude Code to install the skill

Stars1
Forks0
UpdatedJune 4, 2026 at 02:00
SKILL.md
readonly