TalkRL: The Reinforcement Learning Podcast cover image

Jacob Beck and Risto Vuorio

TalkRL: The Reinforcement Learning Podcast

00:00

PPG Methods for Parameterized Policy Gradient

Mammal was kind of the prototypical algorithm in the PPG setting. You can add additional parameters to tune other than just the initialization. There's a whole family thing that build on Mammal, and the interloop is consistent between them. And then I guess the only one we haven't really touched on yet is task inference methods. The idea here is a little more nuanced, but meta learning considers a distribution of MDPs, also known as tasks.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app