"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

Red Teaming o1 Part 1/2– Automated Jailbreaking with Haize Labs' Leonard Tang, Aidan Ewart, and Brian Huang

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

CHAPTER

Exploring Automated Red Teaming Strategies and Evaluation Challenges

This chapter explores the methodologies behind automated red teaming, highlighting the transformation of initial ideas into scalable attack algorithms. The speakers address the importance of manual evaluation and the challenges encountered in assessing potential dangers from language model responses.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner