TalkRL: The Reinforcement Learning Podcast cover image

Jacob Beck and Risto Vuorio

TalkRL: The Reinforcement Learning Podcast

CHAPTER

PPG Methods for Parameterized Policy Gradient

Mammal was kind of the prototypical algorithm in the PPG setting. You can add additional parameters to tune other than just the initialization. There's a whole family thing that build on Mammal, and the interloop is consistent between them. And then I guess the only one we haven't really touched on yet is task inference methods. The idea here is a little more nuanced, but meta learning considers a distribution of MDPs, also known as tasks.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner