"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

Dodging Latent Space Detectors: Obfuscated Activation Attacks with Luke, Erik, and Scott.

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

CHAPTER

Defending Against Machine Learning Threats

This chapter examines the complex dynamics of defense and attack strategies in machine learning, focusing on issues like backdoor triggers and data poisoning. It discusses the challenges in mitigating these threats and the effectiveness of various defense mechanisms, while also emphasizing the vulnerabilities of latent space monitors. The conversation highlights ongoing research on improving AI defenses amidst evolving adversarial tactics and concerns over model access and safety.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner