The Gradient: Perspectives on AI cover image

Marc Bellemare: Distributional Reinforcement Learning

The Gradient: Perspectives on AI

00:00

The Early Thought Process in the Vashas Time Distance

The early thought process was almost accidental but as often the research was maybe also a bit in the design guide that other researchers at DeepMind knew about the Vashas time distance. The post-doc analysis came only later when a third co-author joined Mark Roland and he said here's how you should be doing it. He spent a bit of time and very rapidly had this new framework for these new distances and show that in need there was a contraction mapping that's in the distance. We found those papers and we needed it to write the papers so that we anchored on those early papers in the VashAs time distance.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app