
Reward Mismatches in RL Cause Emergent Misalignment
Don't Worry About the Vase Podcast
00:00
Intro
Zvi Moshowitz opens the episode and frames the topic of reward mismatches and emergent misalignment in RL.
Play episode from 00:00
Transcript

Zvi Moshowitz opens the episode and frames the topic of reward mismatches and emergent misalignment in RL.