The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

MOReL: Model-Based Offline Reinforcement Learning with Aravind Rajeswaran - #442

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Stateful MDPs and Offline RL Insights

This chapter explores the foundational concepts of stateful Markov Decision Processes (MDPs) in the context of offline reinforcement learning. It examines the importance of compact state representations, the explore-exploit trade-off, and the necessity of safety in real-world applications. Additionally, the chapter discusses a novel approach to offline RL that incorporates pessimistic biases in model dynamics, enhancing performance by encouraging agents to operate within known regions.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app