
ep5 - Sean Meyn: Markov chains, networks, reinforcement learning, beekeeping and jazz
inControl
00:00
The Borkel Mind Theorem in ML Optimization Control
The Borkel mind theorem is the interpretation of TD1 learning as an optimal projection. The way that flows into what's called actor critic methods in Conda's thesis is breathtaking. Peter Glenn has something to do with that too. I don't know if it's practical or not. It doesn't matter. It's just an excellent, excellent, elegant theory.
Transcript
Play full episode