
38.5 - Adrià Garriga-Alonso on Detecting AI Scheming
AXRP - the AI X-risk Research Podcast
00:00
Exploring Neural Networks and Goal-Setting in Action
This chapter examines the intricacies of goal-setting within neural network behavior, highlighting the distinctions between short-term and long-term goals. It discusses the relationship between subconscious actions and the cognitive processes involved in skill acquisition, while exploring how neural networks prioritize and evaluate their objectives.
Transcript
Play full episode