Measurement of Generalization in a Diverse Environment?

The trickier part in reinforcement learning is like we were talking about before the environment, you know, we don't have the data readily available. We can't just scrape the internet for millions of hours of exland game play rit we have to somehow create a dynamical system of reinforcement learning process that generates that experience which is rich and interesting. And also how we actually find interesting tasks, generate interesting tasks and get agents to train on these. An inlike measurement of generalization, its own interesting sub area here, liket.

Play episode from 19:08

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app