Generally Intelligent cover image

Episode 17: Andrew Lampinen, DeepMind, on symbolic behavior, mental time travel, and insights from psychology

Generally Intelligent

00:00

Is the R L Agent Learning How to Solve a Task?

In the generalization trials, the r l agent takes longer to complete the task than it does in the trading trialsthing. But because the model is recurrent, now, you can think of that as like having extra time to sort of due some integration of information. And i can't say for sure what's happening, but that's one possibility for what it's doing. Itits sort of like integrating information time and getting more sure of its decision.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app