Selection Pressures from Post-Training and Alignment Tradeoffs

Examination of how RL fine-tuning and post-training create non-linear behavior changes and unintended alignment consequences.

Play episode from 01:27:07

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!