
Special: Defeating AI Defenses (with Nicholas Carlini and Nathan Labenz)
Future of Life Institute Podcast
00:00
Exploring Misclassification through Embedding Space Manipulation
This chapter focuses on understanding embeddings and sophisticated attacks on classification models that aim to misclassify images with minimal noise. The discussion highlights the importance of visualizing these attacks in high-dimensional embedding space and draws parallels to mathematical proofs for deeper comprehension.
Transcript
Play full episode