
Max Schwarzer
TalkRL: The Reinforcement Learning Podcast
The Evolution of the Atari Environment
BBF is in a very different regime than SPR was. But what's weird about this is that BBF does better than humans on most games. Do you think that part of what we're seeing is actually has to do with the nature of the Atari environment? That would be my hunch. I just don't know where to go for it.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.