Luke Bailey

Lead author of "Obfuscated Activations Bypass Large Language Model Latent-Based Defenses," researching AI safety and obfuscated activation attacks.

Best podcasts with Luke Bailey

Ranked by the Snipd community

Jan 18, 2025 • 2h 7min

Dodging Latent Space Detectors: Obfuscated Activation Attacks with Luke, Erik, and Scott.

Luke Bailey and Eric Jenner, both leading experts on AI safety, dive into their research on obfuscated activation attacks. They dissect methods for bypassing latent-based defenses in AI while examining the vulnerabilities these systems face. The conversation highlights complex topics like backdoor attacks, the importance of diverse datasets, and the ongoing challenge of enhancing model robustness. Their work sheds light on the cat-and-mouse game between attackers and defenders, making it clear that the future of AI safety is as intricate as it is essential.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner