
AI #135: OpenAI Shows Us The Money
Don't Worry About the Vase Podcast
00:00
Selection Pressures from Post-Training and Alignment Tradeoffs
Examination of how RL fine-tuning and post-training create non-linear behavior changes and unintended alignment consequences.
Play episode from 01:27:07
Transcript


