TalkRL: The Reinforcement Learning Podcast cover image

Jacob Beck and Risto Vuorio

TalkRL: The Reinforcement Learning Podcast

CHAPTER

Train Agents on Atari

The goal of that setting can be stated as learning a traditional RL algorithm. Then the other axes of that setting is whether we're dealing with a single task or a multi-task setting. This isn't something that is super often discussed in, especially in some parts of the metarolature but the single-task case is still very interesting and it actually does. So you can train agents on Atari where you actually metarolerned the objective function for the policy gradient that's then updating your policy just on that single task.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner