7 items with this tag.12/18/20252025-Era “Reward Hacking” Does Not Show that Reward Is the Optimization Targetreinforcement learningspecification gamingAI11/22/2025Output Supervision Can Obfuscate the CoTreinforcement learningmats programAI6/19/2023Mode Collapse in RL May Be Fueled by the Update Equationreinforcement learningAI6/2/2023Think Carefully Before Calling RL Policies “Agents”reinforcement learningAI7/25/2022Reward Is Not the Optimization Targetreinforcement learningshard theoryAI9/13/2019What You See Isn’t Always What You Wantreinforcement learningAI7/5/2018Making a Difference Tempore: Insights From “Reinforcement Learning: An Introduction”reinforcement learningsummaries