
AI #135: OpenAI Shows Us The Money
Don't Worry About the Vase Podcast
00:00
Selection Pressures from Post-Training and Alignment Tradeoffs
Examination of how RL fine-tuning and post-training create non-linear behavior changes and unintended alignment consequences.
Transcript
Play full episode