The Gradient: Perspectives on AI cover image

Marc Bellemare: Distributional Reinforcement Learning

The Gradient: Perspectives on AI

00:00

Distribution RL

The final controller that we deployed in production was trained on something like 1.3 million flights. The bottom line performance is about just 1% difference but a bit like how Magnus Carlsen will be very close to all of the other very good chess players around him it's very hard to know if that 1% is worth a lot or not. So my belief is that it is but the jury is still out on that.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app