
AI #138 Part 2: Watch Out For Documents
Don't Worry About the Vase Podcast
00:00
You get what you train for
Zvi discusses papers showing reward-hacking, alignment-faking, and how optimization pressures produce misaligned behaviors.
Transcript
Play full episode