Towards Data Science cover image

100. Max Jaderberg - Open-ended learning at DeepMind

Towards Data Science

00:00

Measurement of Generalization in a Diverse Environment?

The trickier part in reinforcement learning is like we were talking about before the environment, you know, we don't have the data readily available. We can't just scrape the internet for millions of hours of exland game play rit we have to somehow create a dynamical system of reinforcement learning process that generates that experience which is rich and interesting. And also how we actually find interesting tasks, generate interesting tasks and get agents to train on these. An inlike measurement of generalization, its own interesting sub area here, liket.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app