AXRP - the AI X-risk Research Podcast cover image

38.5 - Adrià Garriga-Alonso on Detecting AI Scheming

AXRP - the AI X-risk Research Podcast

00:00

Exploring Decision-Making in Neural Networks

This chapter examines the performance of recurrent neural networks in game-like environments and how additional thinking time impacts their problem-solving capabilities. It explores the development of intrinsic meta-strategies within the networks, particularly regarding pacing and resource management in decision-making. The discussion also highlights the complexities of planning behavior in AI, with a focus on goal-directed actions and the cognitive demands involved in executing strategies.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app