Model Free Versus Model Base Reenforcement Learning

Most of the reinforcement learning that we use at a large scol is moke free. For example, p po and deep cur learning are all molo free. The main reasons why people are not using modobas alism is that to learn the mode of the word is very hard. You need to first learn to basely do computer vision, and then you need to do a whole separate sort of dynamical prediction.

Play episode from 08:56

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app