
Reward Mismatches in RL Cause Emergent Misalignment
Don't Worry About the Vase Podcast
00:00
Short-Term Impact on Practical Problems
Zvi argues these insights help practical alignment work and training-time evaluations if applied carefully.
Play episode from 14:09
Transcript


