Reinforcement Learning and the Future of AI

The future is language models are everywhere, we're generating tons of data in that interaction and that data will be used to train more AI systems. Reinforcement learning gives me time horizons. I can just play a thousand times until the end of the episode. I see how my actions change. You have changed impact your future actions. So when people say what is more good for it, well, it turns out that if we look under the hood, every deployed machine learning system becomes a multi-agent learning problem because these language models exist in the same environment with humans.

Play episode from 41:12

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app