"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

Red Teaming o1 Part 2/2– Detecting Deception with Marius Hobbhahn of Apollo Research

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

00:00

Evaluating AI Models: Insights and Challenges

This chapter explores the assessment of a new AI model, focusing on its performance and improvements in reliability and accuracy compared to previous iterations. The discussion highlights the impact of time constraints on testing methodologies and the importance of structured analysis in research evaluations.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app