TalkRL: The Reinforcement Learning Podcast cover image

Jeff Clune

TalkRL: The Reinforcement Learning Podcast

CHAPTER

The Pathologies of Detachment in Reinforcement Learning

Go Explorer is an algorithm that rewards players for getting to new states of the game. It works better than nothing, but it doesn't solve the game and leaves a lot of it on the table. Go Explore uses two different approaches: detachment and derailment. When we solved both of those pathologies, all of a sudden we had an algorithm called Go Explore.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner