
Jacob Beck and Risto Vuorio
TalkRL: The Reinforcement Learning Podcast
00:00
Train Agents on Atari
The goal of that setting can be stated as learning a traditional RL algorithm. Then the other axes of that setting is whether we're dealing with a single task or a multi-task setting. This isn't something that is super often discussed in, especially in some parts of the metarolature but the single-task case is still very interesting and it actually does. So you can train agents on Atari where you actually metarolerned the objective function for the policy gradient that's then updating your policy just on that single task.
Transcript
Play full episode