TalkRL: The Reinforcement Learning Podcast cover image

John Schulman

TalkRL: The Reinforcement Learning Podcast

00:00

What Are the Biggest Highlights in RL?

There's been a lot of work in RL since TRPO and PBO. TD3 and SAC seem like pretty solid value-based methods, he says. offline RL was also notable; we're training against that dataset.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app