Generally Intelligent cover image

Episode 24: Jack Parker-Holder, DeepMind, on open-endedness, evolving agents and environments, online adaptation, and offline learning

Generally Intelligent

CHAPTER

Data Efficiency in Simulation Is Not That Useful

In most RL like benchmarks or training environments, the data is a lot smaller. We should be saying how good an agent can we get given as much computers we should find in simulation. But what we really know is being data efficient at test time which could be in the real world. So I don't care if it takes billions of samples to then be able to adapt quickly online when you do get that information. The key thing is I just don't adapt straight away.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner