
Special: Defeating AI Defenses (with Nicholas Carlini and Nathan Labenz)
Future of Life Institute Podcast
00:00
Navigating AI Memory and Privacy
This chapter explores how language models retain and can output sensitive information from their training, including the complexities of data unlearning and factual accuracy. It discusses advanced techniques for manipulating AI knowledge while addressing the significant challenges and implications for privacy and safety.
Transcript
Play full episode