Discounted vs. Average Reward Algorithms

This chapter examines the debate surrounding average and discounted reward algorithms in reinforcement learning, emphasizing the role of the discount factor gamma. It discusses the practical advantages of adapting the discount factor according to learning confidence, and the importance of separating average rewards from discount scaling for improved decision-making. Additionally, the episode reflects on personal experiences in a PhD program, highlighting the mentorship that fosters growth in research and collaboration.

Play episode from 37:49

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app