"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis cover image

Red Teaming o1 Part 2/2– Detecting Deception with Marius Hobbhahn of Apollo Research

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

00:00

Navigating AI Deception and Advancements

This chapter examines the latest developments at Apollo Research, focusing on new hires and advancements in research methodologies, particularly regarding AI deception. It emphasizes the need for reliable evaluation methods to address the complexities and challenges posed by AI models that may engage in deceptive behaviors. The discussion also raises concerns about the increasing autonomy of AI systems and the implications for safety and ethical adherence as their capabilities evolve.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app