Get the app
Luke Bailey
Lead author of "Obfuscated Activations Bypass Large Language Model Latent-Based Defenses," researching AI safety and obfuscated activation attacks.
Best podcasts with Luke Bailey
Ranked by the Snipd community
31 snips
Jan 18, 2025
• 2h 10min
Dodging Latent Space Detectors: Obfuscated Activation Attacks with Luke, Erik, and Scott.
chevron_right
Luke Bailey and Eric Jenner, both leading experts on AI safety, dive into their research on obfuscated activation attacks. They dissect methods for bypassing latent-based defenses in AI while examining the vulnerabilities these systems face. The conversation highlights complex topics like backdoor attacks, the importance of diverse datasets, and the ongoing challenge of enhancing model robustness. Their work sheds light on the cat-and-mouse game between attackers and defenders, making it clear that the future of AI safety is as intricate as it is essential.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app