The Pond

Search

About me
My research
Random post
All posts
Open source
Subscribe

Tag: reinforcement learning

7 items with this tag.

12/18/2025
2025-Era “Reward Hacking” Does Not Show that Reward Is the Optimization Target
11/22/2025
Output Supervision Can Obfuscate the CoT
6/19/2023
Mode Collapse in RL May Be Fueled by the Update Equation
6/2/2023
Think Carefully Before Calling RL Policies “Agents”
- reinforcement learning
- AI
7/25/2022
Reward Is Not the Optimization Target
9/13/2019
What You See Isn’t Always What You Want
7/5/2018
Making a Difference Tempore: Insights From “Reinforcement Learning: An Introduction”
- reinforcement learning
- summaries