
 The Theory of Anything
 The Theory of Anything Episode 28: Reinforcement Learning and Q-Learning
 Aug 9, 2021 
 Learn about reinforcement learning and its stunning victory in Alpha Go. Discover the theory of reinforcement learning and how it solves Markov Decision Problems. Explore important concepts such as state transition and reward functions. Find out the origins of dynamic programming and its challenges without knowledge of these functions. Gain insight into key concepts in reinforcement learning and its potential as a general learning algorithm. Lastly, delve into the learning abilities of animals and comparison to humans and machine learning. 
 Chapters 
 Transcript 
 Episode notes 
 1  2  3  4  5  6  7 
 Introduction 
 00:00 • 3min 
 AlphaGo vs. Lisa Doll: Reinforcement Learning in Go 
 02:50 • 26min 
 State Transition and Reward Functions 
 29:17 • 11min 
 Origins of Dynamic Programming 
 40:07 • 8min 
 Challenges of Reinforcement Learning without Knowledge of State Transition and Reward Functions 
 47:53 • 3min 
 Explaining Key Concepts in Reinforcement Learning 
 50:54 • 29min 
 Animal Intelligence and Learning Abilities 
 01:19:40 • 3min 
