
Jacob Beck and Risto Vuorio
TalkRL: The Reinforcement Learning Podcast
Train Agents on Atari
The goal of that setting can be stated as learning a traditional RL algorithm. Then the other axes of that setting is whether we're dealing with a single task or a multi-task setting. This isn't something that is super often discussed in, especially in some parts of the metarolature but the single-task case is still very interesting and it actually does. So you can train agents on Atari where you actually metarolerned the objective function for the policy gradient that's then updating your policy just on that single task.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.