
Is ChatGPT Getting Worse? with James Zou - #645
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
00:00
Unraveling AI Complexity: The Paradoxes of Safety Enhancements
This chapter explores the intricate relationship between AI model performance and safety modifications. It introduces the concept of neuropleiotropy, revealing how safety improvements can unintentionally impact other aspects of AI behavior through specific experimental examples.
Play episode from 11:49
Transcript


