TalkRL: The Reinforcement Learning Podcast cover image

Max Schwarzer

TalkRL: The Reinforcement Learning Podcast

00:00

The Importance of Using Random Seeds to Estimate Confidence Intervals

With your BBF agent, bigger, better, faster. You got really exciting results on this Atari 100K benchmark and 100K is a tiny amount of samples,. Especially compared to that original DQN or back in the day. It was way up in the millions. Yep, exactly. And with some agents, like the Curiosity agents going up to billions of samples. So I understand there's about two hours of play on each game, which is comparable to a human. What a human would take. Exactly.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app