TalkRL: The Reinforcement Learning Podcast cover image

Jeff Clune

TalkRL: The Reinforcement Learning Podcast

00:00

The History of Random Exploration

VPT is an algorithm that can learn to do things based on what it sees in the real world. It watches YouTube videos of people playing Minecraft and learns how to get a diamond pickaxe by watching them for eight years. The system works with contractors, which doesn't cost all that much money.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app