TalkRL: The Reinforcement Learning Podcast cover image

Jeff Clune

TalkRL: The Reinforcement Learning Podcast

CHAPTER

The Achilles Heel of Reinforcement Learning

GoExplorer is a very high fidelity exploration algorithm. It's based on the original DQN paper, which launched 10,000 papers in the sense of kicking off the deep reinforcement learning revolution and putting deep mind on the map. One game in which they literally got zero was Monos and Revenge because it's very, very difficult to ever get any reward in that game through random actions.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner