The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Is ChatGPT Getting Worse? with James Zou - #645

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

CHAPTER

Unraveling AI Complexity: The Paradoxes of Safety Enhancements

This chapter explores the intricate relationship between AI model performance and safety modifications. It introduces the concept of neuropleiotropy, revealing how safety improvements can unintentionally impact other aspects of AI behavior through specific experimental examples.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner