Future of Life Institute Podcast cover image

Special: Defeating AI Defenses (with Nicholas Carlini and Nathan Labenz)

Future of Life Institute Podcast

00:00

Navigating AI Memory and Privacy

This chapter explores how language models retain and can output sensitive information from their training, including the complexities of data unlearning and factual accuracy. It discusses advanced techniques for manipulating AI knowledge while addressing the significant challenges and implications for privacy and safety.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app