
Robert Lange
TalkRL: The Reinforcement Learning Podcast
Using Hindsight Action Replay to Train Value Estimates for Macro Actions
Peter came up with a clever approach called hindsight action replay. Instead of only storing macro actions in the replay buffer, we can also use sequences of primitive actions to construct macro actions. This allows us to train our value estimates for specific macro actions. It's a cool and innovative method that can be further enhanced by using different discounts.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.