"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

Dodging Latent Space Detectors: Obfuscated Activation Attacks with Luke, Erik, and Scott.

Jan 18, 2025
Luke Bailey and Eric Jenner, both leading experts on AI safety, dive into their research on obfuscated activation attacks. They dissect methods for bypassing latent-based defenses in AI while examining the vulnerabilities these systems face. The conversation highlights complex topics like backdoor attacks, the importance of diverse datasets, and the ongoing challenge of enhancing model robustness. Their work sheds light on the cat-and-mouse game between attackers and defenders, making it clear that the future of AI safety is as intricate as it is essential.
02:10:23

Podcast summary created with Snipd AI

Quick takeaways

  • The research team highlights alarming safety concerns in AI systems due to potential risks like backdoors and data poisoning.
  • The concept of obfuscated activations reveals how attackers can creatively bypass latent space defenses, challenging the effectiveness of current safety measures.

Deep dives

Introduction to Modern Relationships Podcast

The podcast introduces a new show titled 'Modern Relationships,' featuring power couples in tech and relationship experts discussing how ambitious individuals manage their partnerships. The first episode highlights Delian and Nadia Asparuhov's journey from dating to marriage and parenthood, openly sharing the growth and challenges they've faced. Listeners are encouraged to tune in if they are dating or interested in how technology is transforming modern love.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner