Executive function describes a set of cognitive processes and mental skills that help an individual plan, monitor, and successfully execute their goals. The “executive functions,” as they’re known, ...
SDPG is the main contribution. It extends GRPO with an exact per-token forward KL between the actor (without privileged context) and itself conditioned on privileged context c: ...