Andy Zou

PhD candidate at Carnegie Mellon University researching adversarial attacks and mimetic initialization of language models.

Best podcasts with Andy Zou

Ranked by the Snipd community

Sep 22, 2023 • 2h 16min

Universal Jailbreaks with Zico Kolter, Andy Zou, and Asher Trockman

In this discussion, Zico Kolter, a leading professor at Carnegie Mellon University, Andy Zou, a PhD candidate, and Asher Trockman explore the intricate world of universal adversarial attacks on language models. They delve into the motivations behind these attacks and how simple tweaks can disrupt model behavior. Their conversation highlights the potential short-term harms and long-term risks of 'jailbreaking' AI, including implications for training data and the complexities of model responses. They'll also touch on the exciting future of AI defenses in this evolving landscape.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner