"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

Dodging Latent Space Detectors: Obfuscated Activation Attacks with Luke, Erik, and Scott.

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

00:00

Defending Against Machine Learning Threats

This chapter examines the complex dynamics of defense and attack strategies in machine learning, focusing on issues like backdoor triggers and data poisoning. It discusses the challenges in mitigating these threats and the effectiveness of various defense mechanisms, while also emphasizing the vulnerabilities of latent space monitors. The conversation highlights ongoing research on improving AI defenses amidst evolving adversarial tactics and concerns over model access and safety.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app