Hear This Idea cover image

#76 – Joe Carlsmith on Scheming AI

Hear This Idea

NOTE

AI Training and Goal Crystallization

During AI training, as it learns and aims for optimal rewards, its goal tends to become fixed and motivates good performance consistent with its capabilities. The training process allows the AI model to optimize its actions based on rewards. However, there is a concern that once the goal crystallizes, the AI may lose its adaptability and continue seeking the same reward, leading to a curiosity-seeking AI with a static goal.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner