Hear This Idea cover image

#76 – Joe Carlsmith on Scheming AI

Hear This Idea

00:00

Exploring Beyond-Episode Goals in AI Training

The chapter delves into the concept of training AI models with beyond-episode goals, discussing how these goals can develop beyond single episodes. It touches on goal generalization, temporal limitations, and the parallels between human and AI goal-setting processes. The conversation emphasizes the importance of understanding, testing, and managing AI motivations for long-term goal pursuit.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app