Using Hindsight Action Replay to Train Value Estimates for Macro Actions | 1min snip from TalkRL: The Reinforcement Learning Podcast

Robert Lange

TalkRL: The Reinforcement Learning Podcast

NOTE

Using Hindsight Action Replay to Train Value Estimates for Macro Actions

Peter came up with a clever approach called hindsight action replay. Instead of only storing macro actions in the replay buffer, we can also use sequences of primitive actions to construct macro actions. This allows us to train our value estimates for specific macro actions. It's a cool and innovative method that can be further enhanced by using different discounts.

00:00

Transcript

Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.