Towards Data Science cover image

106. Yang Gao - Sample-efficient AI

Towards Data Science

CHAPTER

Model Free Versus Model Base Reenforcement Learning

Most of the reinforcement learning that we use at a large scol is moke free. For example, p po and deep cur learning are all molo free. The main reasons why people are not using modobas alism is that to learn the mode of the word is very hard. You need to first learn to basely do computer vision, and then you need to do a whole separate sort of dynamical prediction.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner