Towards Data Science cover image

106. Yang Gao - Sample-efficient AI

Towards Data Science

00:00

Model Free Versus Model Base Reenforcement Learning

Most of the reinforcement learning that we use at a large scol is moke free. For example, p po and deep cur learning are all molo free. The main reasons why people are not using modobas alism is that to learn the mode of the word is very hard. You need to first learn to basely do computer vision, and then you need to do a whole separate sort of dynamical prediction.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app