TalkRL: The Reinforcement Learning Podcast

Robin Ranjit Singh Chauhan

TalkRL podcast is All Reinforcement Learning, All the Time.
In-depth interviews with brilliant people at the forefront of RL research and practice.
Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute.
Hosted by Robin Ranjit Singh Chauhan.

Episodes

Mentioned books

Jan 3, 2026 • 4min

Joseph Modayil of Openmind Research Institute @ RLC 2025

Joseph Modayil is the Founder, President & Research Director of Openmind Research Institute.Featured References Openmind Research Institute The Alberta Plan for AI Research Richard S. Sutton, Michael Bowling, Patrick M. Pilarski Additional References Joseph Modayil on Google Scholar Joseph Modayil Homepage

Nov 10, 2025 • 1h 41min

Danijar Hafner on Dreamer v4

Danijar Hafner was a Research Scientist at Google DeepMind until recently.Featured References Training Agents Inside of Scalable World Models [ blog ] Danijar Hafner, Wilson Yan, Timothy LillicrapOne Step Diffusion via Shortcut ModelsKevin Frans, Danijar Hafner, Sergey Levine, Pieter AbbeelAction and Perception as Divergence Minimization [ blog ] Danijar Hafner, Pedro A. Ortega, Jimmy Ba, Thomas Parr, Karl Friston, Nicolas Heess Additional References Mastering Diverse Domains through World Models [ blog ] DreaverV3l Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap Mastering Atari with Discrete World Models [ blog ] DreaverV2 ; Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba Dream to Control: Learning Behaviors by Latent Imagination [ blog ] Dreamer ; Danijar Hafner, Timothy Lillicrap, Jimmy Ba, Mohammad Norouzi Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos [ Blog Post ], Baker et al

Sep 8, 2025 • 60min

David Abel on the Science of Agency @ RLDM 2025

David Abel, a Senior Research Scientist at DeepMind specializing in agency, dives into the fascinating intersection of reinforcement learning and philosophy. He discusses how clear definitions can drive advancements in the field and the relationship between plasticity and empowerment in intelligent agents. The conversation also covers the significance of mutual and directed information in communication theory and challenges traditional views on continual reinforcement learning, advocating for a more dynamic understanding of agency in AI.

Aug 19, 2025 • 12min

Jake Beck, Alex Goldie, & Cornelius Braun on Sutton's OaK, Metalearning, LLMs, Squirrels @ RLC 2025

Recorded at Reinforcement Learning Conference 2025 at University of Alberta, Edmonton Alberta Canada.Featured ReferencesLecture on the Oak Architecture, Rich SuttonAlberta Plan, Rich Sutton with Mike Bowling and Patrick Pilarski Additional ReferencesJacob Beck on Google Scholar Alex Goldie on Google ScholarCornelius Braun on Google ScholarReinforcement Learning Conference

Aug 18, 2025 • 14min

Outstanding Paper Award Winners - 2/2 @ RLC 2025

We caught up with the RLC Outstanding Paper award winners for your listening pleasure. Recorded on location at Reinforcement Learning Conference 2025, at University of Alberta, in Edmonton Alberta Canada in August 2025.Featured References Empirical Reinforcement Learning ResearchMitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functionsAyush Jain, Norio Kosaka, Xinhu Li, Kyung-Min Kim, Erdem Biyik, Joseph J LimApplications of Reinforcement LearningWOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management StrategiesWilliam Solow, Sandhya Saisubramanian, Alan FernEmerging Topics in Reinforcement LearningTowards Improving Reward Design in RL: A Reward Alignment Metric for RL PractitionersCalarina Muslimani, Kerrick Johnstonbaugh, Suyog Chandramouli, Serena Booth, W. Bradley Knox, Matthew E. TaylorScientific Understanding in Reinforcement LearningMulti-Task Reinforcement Learning Enables Parameter ScalingReginald McLean, Evangelos Chatzaroulas, J K Terry, Isaac Woungang, Nariman Farsad, Pablo Samuel Castro

Aug 15, 2025 • 7min

Outstanding Paper Award Winners - 1/2 @ RLC 2025

Explore groundbreaking advancements in reinforcement learning as award-winning researchers share insights from RLC 2025. Discover innovative meta-learning techniques for algorithm discovery and the introduction of a user-friendly curriculum learning library. Dive into the power of PufferLib for ultra-efficient training in diverse applications, alongside discussions on cutting-edge algorithms like prioritized level replay in Minecraft. Lastly, engage with theories that improve convergence and performance in deep reinforcement learning.

Aug 4, 2025 • 52min

Thomas Akam on Model-based RL in the Brain

Thomas Akam, a prominent neuroscientist from Oxford, leads the Cognitive Circuits Research Group and explores the fascinating interface of the brain and behavior. He discusses how the brain adapts actions through internal models, revealing insights into decision-making mechanisms. The conversation also highlights the metabolic costs of brain tissue, contrasting biological intelligence with AI. Additionally, Akam shares advancements in brain measurement technologies, paving the way for exciting strides in understanding cognitive processes.

Jul 22, 2025 • 32min

Stefano Albrecht on Multi-Agent RL @ RLDM 2025

Stefano V. Albrecht was previously Associate Professor at the University of Edinburgh, and is currently serving as Director of AI at startup Deepflow. He is a Program Chair of RLDM 2025 and is co-author of the MIT Press textbook "Multi-Agent Reinforcement Learning: Foundations and Modern Approaches".Featured ReferencesMulti-Agent Reinforcement Learning: Foundations and Modern ApproachesStefano V. Albrecht, Filippos Christianos, Lukas SchäferMIT Press, 2024RLDM 2025: Reinforcement Learning and Decision Making ConferenceDublin, IrelandEPyMARL: Extended Python MARL frameworkhttps://github.com/uoe-agents/epymarlBenchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative TasksGeorgios Papoudakis and Filippos Christianos and Lukas Schäfer and Stefano V. Albrecht

Jun 25, 2025 • 6min

Satinder Singh: The Origin Story of RLDM @ RLDM 2025

Professor Satinder Singh of Google DeepMind and U of Michigan is co-founder of RLDM. Here he narrates the origin story of the Reinforcement Learning and Decision Making meeting (not conference).Recorded on location at Trinity College Dublin, Ireland during RLDM 2025.Featured ReferencesRLDM 2025: Multi-disciplinary Conference on Reinforcement Learning and Decision Making (RLDM)June 11-14, 2025 at Trinity College Dublin, IrelandSatinder Singh on Google Scholar

Mar 9, 2025 • 10min

NeurIPS 2024 - Posters and Hallways 3

Discover innovative benchmarks for multi-agent reinforcement learning in wind farm control, tackling turbine performance issues. Learn about groundbreaking methods that bridge simulation and real-world applications, enhancing exploration strategies. Delve into contextual bi-level reinforcement learning, using leader-follower dynamics for optimizing rewards. Also, explore the QGEN framework, which revolutionizes queuing network simulations with deep learning, setting new standards in action space definition.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

App store banner

Play store banner