"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

Red Teaming o1 Part 1/2– Automated Jailbreaking with Haize Labs' Leonard Tang, Aidan Ewart, and Brian Huang

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

00:00

Exploring Automated Red Teaming Strategies and Evaluation Challenges

This chapter explores the methodologies behind automated red teaming, highlighting the transformation of initial ideas into scalable attack algorithms. The speakers address the importance of manual evaluation and the challenges encountered in assessing potential dangers from language model responses.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app