Hear This Idea cover image

#76 – Joe Carlsmith on Scheming AI

Hear This Idea

CHAPTER

Exploring Beyond-Episode Goals in AI Training

The chapter delves into the concept of training AI models with beyond-episode goals, discussing how these goals can develop beyond single episodes. It touches on goal generalization, temporal limitations, and the parallels between human and AI goal-setting processes. The conversation emphasizes the importance of understanding, testing, and managing AI motivations for long-term goal pursuit.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner