TalkRL: The Reinforcement Learning Podcast

75 episodes

Joseph Modayil of Openmind Research Institute @ RLC 2025
2026/01/03 | 4 mins.
Joseph Modayil is the Founder, President & Research Director of Openmind Research Institute.
Featured References
Openmind Research Institute
The Alberta Plan for AI Research
Richard S. Sutton, Michael Bowling, Patrick M. Pilarski

Additional References
Joseph Modayil on Google Scholar
Joseph Modayil Homepage
Danijar Hafner on Dreamer v4
2025/11/10 | 1h 40 mins.
Danijar Hafner was a Research Scientist at Google DeepMind until recently.

Featured References
Training Agents Inside of Scalable World Models [ blog ]
Danijar Hafner, Wilson Yan, Timothy Lillicrap
One Step Diffusion via Shortcut Models
Kevin Frans, Danijar Hafner, Sergey Levine, Pieter Abbeel
Action and Perception as Divergence Minimization [ blog ]
Danijar Hafner, Pedro A. Ortega, Jimmy Ba, Thomas Parr, Karl Friston, Nicolas Heess

Additional References
Mastering Diverse Domains through World Models [ blog ] DreaverV3l Danijar Hafner, Jurgis Pasukonis, Jimmy Ba, Timothy Lillicrap
Mastering Atari with Discrete World Models [ blog ] DreaverV2 ; Danijar Hafner, Timothy Lillicrap, Mohammad Norouzi, Jimmy Ba
Dream to Control: Learning Behaviors by Latent Imagination [ blog ] Dreamer ; Danijar Hafner, Timothy Lillicrap, Jimmy Ba, Mohammad Norouzi
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos [ Blog Post ], Baker et al
David Abel on the Science of Agency @ RLDM 2025
2025/09/08 | 59 mins.
David Abel is a Senior Research Scientist at DeepMind on the Agency team, and an Honorary Fellow at the University of Edinburgh. His research blends computer science and philosophy, exploring foundational questions about reinforcement learning, definitions, and the nature of agency.

Featured References

Plasticity as the Mirror of Empowerment
David Abel, Michael Bowling, André Barreto, Will Dabney, Shi Dong, Steven Hansen, Anna Harutyunyan, Khimya Khetarpal, Clare Lyle, Razvan Pascanu, Georgios Piliouras, Doina Precup, Jonathan Richens, Mark Rowland, Tom Schaul, Satinder Singh

A Definition of Continual RL
David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado van Hasselt, Satinder Singh

Agency is Frame-Dependent
David Abel, André Barreto, Michael Bowling, Will Dabney, Shi Dong, Steven Hansen, Anna Harutyunyan, Khimya Khetarpal, Clare Lyle, Razvan Pascanu, Georgios Piliouras, Doina Precup, Jonathan Richens, Mark Rowland, Tom Schaul, Satinder Singh

On the Expressivity of Markov Reward
David Abel, Will Dabney, Anna Harutyunyan, Mark Ho, Michael Littman, Doina Precup, Satinder Singh — Outstanding Paper Award, NeurIPS 2021

Additional References
Bidirectional Communication Theory — Marko 1973
Causality, Feedback and Directed Information — Massey 1990
The Big World Hypothesis — Javed et al. 2024
Loss of plasticity in deep continual learning — Dohare et al. 2024
Three Dogmas of Reinforcement Learning — Abel 2024
Explaining dopamine through prediction errors and beyond — Gershman et al. 2024
David Abel Google Scholar
David Abel personal website
Jake Beck, Alex Goldie, & Cornelius Braun on Sutton's OaK, Metalearning, LLMs, Squirrels @ RLC 2025
2025/08/19 | 12 mins.
Recorded at Reinforcement Learning Conference 2025 at University of Alberta, Edmonton Alberta Canada.
Featured References

Lecture on the Oak Architecture, Rich Sutton
Alberta Plan, Rich Sutton with Mike Bowling and Patrick Pilarski

Additional References
Jacob Beck on Google Scholar
Alex Goldie on Google Scholar
Cornelius Braun on Google Scholar
Reinforcement Learning Conference
Outstanding Paper Award Winners - 2/2 @ RLC 2025
2025/08/18 | 14 mins.
We caught up with the RLC Outstanding Paper award winners for your listening pleasure.
Recorded on location at Reinforcement Learning Conference 2025, at University of Alberta, in Edmonton Alberta Canada in August 2025.
Featured References

Empirical Reinforcement Learning Research
Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions
Ayush Jain, Norio Kosaka, Xinhu Li, Kyung-Min Kim, Erdem Biyik, Joseph J Lim
Applications of Reinforcement Learning
WOFOSTGym: A Crop Simulator for Learning Annual and Perennial Crop Management Strategies
William Solow, Sandhya Saisubramanian, Alan Fern
Emerging Topics in Reinforcement Learning
Towards Improving Reward Design in RL: A Reward Alignment Metric for RL Practitioners
Calarina Muslimani, Kerrick Johnstonbaugh, Suyog Chandramouli, Serena Booth, W. Bradley Knox, Matthew E. Taylor
Scientific Understanding in Reinforcement Learning
Multi-Task Reinforcement Learning Enables Parameter Scaling
Reginald McLean, Evangelos Chatzaroulas, J K Terry, Isaac Woungang, Nariman Farsad, Pablo Samuel Castro

More Technology podcasts

Trending Technology podcasts

About TalkRL: The Reinforcement Learning Podcast

TalkRL podcast is All Reinforcement Learning, All the Time. In-depth interviews with brilliant people at the forefront of RL research and practice. Guests from places like MILA, OpenAI, MIT, DeepMind, Berkeley, Amii, Oxford, Google Research, Brown, Waymo, Caltech, and Vector Institute. Hosted by Robin Ranjit Singh Chauhan.

Podcast website

Technology

Listen to TalkRL: The Reinforcement Learning Podcast, All-In with Chamath, Jason, Sacks & Friedberg and many other podcasts from around the world with the radio.net app

Get the free radio.net app

Stations and podcasts to bookmark
Stream via Wi-Fi or Bluetooth
Supports Carplay & Android Auto
Many other app features

Open app

Get the free radio.net app

Stations and podcasts to bookmark
Stream via Wi-Fi or Bluetooth
Supports Carplay & Android Auto
Many other app features

TalkRL: The Reinforcement Learning Podcast

Scan code,
download the app,
start listening.