Skip to main content
Run any Skill in Manus
with one click

bi-nac-bilevel-rl-textual-feedback

Bilevel Natural Language Actor-Critic (Bi-NAC) methodology — joint training of a critic to generate reward-improving textual feedback and an actor to exploit it, formulated as a Stackelberg bilevel program for RL with learnable textual feedback.

Overview

Bilevel Natural Language Actor-Critic (Bi-NAC) methodology — joint training of a critic to generate reward-improving textual feedback and an actor to exploit it, formulated as a Stackelberg bilevel program for RL with learnable textual feedback.

Install command
npx skills add https://github.com/hiyenwong/ai_collection --skill bi-nac-bilevel-rl-textual-feedback

Copy and paste this command into Claude Code to install the skill

Stars1
Forks0
UpdatedJune 4, 2026 at 02:00
SKILL.md
readonly