#8116

Mentioned in 6 episodes

Reinforcement Learning: An Introduction

Second Edition

Book • 2018

Andrew G. Barto

This second edition of 'Reinforcement Learning: An Introduction' by Richard S. Sutton and Andrew G. Barto provides a clear and simple account of the field's key ideas and algorithms.

The book is significantly expanded and updated, including new topics such as artificial neural networks, the Fourier basis, and expanded treatment of off-policy learning and policy-gradient methods.

It also includes new chapters on the relationships between reinforcement learning and psychology/neuroscience, as well as updated case studies on AlphaGo, AlphaGo Zero, Atari game playing, and IBM Watson's wagering strategy.

The final chapter discusses the future societal impacts of reinforcement learning.

Mentioned by

Mentioned in 6 episodes

Mentioned by

George Hotz

when discussing the importance of accessible ML compute.

Commoditizing the Petaflop — with George Hotz of the tiny corp

Mentioned by

Minqi Jiang

as a source of foundational knowledge in reinforcement learning.

#114 - Secrets of Deep Reinforcement Learning (Minqi Jiang)

Mentioned by

David Silver

when discussing his journey into reinforcement learning.

#86 – David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning

Recommended by

Marcus Hutter

as a good introduction to reinforcement learning, although he notes it simplifies the subject.

#75 – Marcus Hutter: Universal Artificial Intelligence, AIXI, and AGI

Recommended by

Nathan Lambert

as the classic RL textbook, relevant to understanding RLHF.

The state of post-training in 2025

Recommended by

Tom Gilbert

as a fun and important read on reinforcement learning.

AI: Open vs Closed + NeurIPS Reflections

Mentioned by

Abhishek Naik

as the author of the RL book, whose second part includes sections on average reward and why discounting should be deprecated.

Abhishek Naik on Continuing RL & Average Reward

Mentioned as author of a seminal work on reinforcement learning.

Pablo Arredondo and Joel Hron on Reasoning Models, Deep Research, and the Future of Legal AI

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner