inControl cover image

ep5 - Sean Meyn: Markov chains, networks, reinforcement learning, beekeeping and jazz

inControl

00:00

The Borkel Mind Theorem in ML Optimization Control

The Borkel mind theorem is the interpretation of TD1 learning as an optimal projection. The way that flows into what's called actor critic methods in Conda's thesis is breathtaking. Peter Glenn has something to do with that too. I don't know if it's practical or not. It doesn't matter. It's just an excellent, excellent, elegant theory.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app