
Jeff Clune
TalkRL: The Reinforcement Learning Podcast
The Achilles Heel of Reinforcement Learning
GoExplorer is a very high fidelity exploration algorithm. It's based on the original DQN paper, which launched 10,000 papers in the sense of kicking off the deep reinforcement learning revolution and putting deep mind on the map. One game in which they literally got zero was Monos and Revenge because it's very, very difficult to ever get any reward in that game through random actions.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.