
Jacob Beck and Risto Vuorio
TalkRL: The Reinforcement Learning Podcast
PPG Methods for Parameterized Policy Gradient
Mammal was kind of the prototypical algorithm in the PPG setting. You can add additional parameters to tune other than just the initialization. There's a whole family thing that build on Mammal, and the interloop is consistent between them. And then I guess the only one we haven't really touched on yet is task inference methods. The idea here is a little more nuanced, but meta learning considers a distribution of MDPs, also known as tasks.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.