TalkRL: The Reinforcement Learning Podcast cover image

NeurIPS 2024 - Posters and Hallways 1

TalkRL: The Reinforcement Learning Podcast

CHAPTER

Exploring Reinforcement Learning with Hidden Rewards

This chapter delves into exploration strategies within the context of Monitored Markov Decision Processes (MDPs), particularly when rewards are not fully visible. It critiques traditional optimism-based methods and discusses alternative approaches for effectively navigating less observable environments in reinforcement learning applications.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner