2min chapter

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Deep Learning, Transformers, and the Consequences of Scale with Oriol Vinyals - #546

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

CHAPTER

The Importance of Imitating Actions in Language Modelling

Imitation is one part byt so imitating the actions very well is what language modelling regards with, and it's very important. But there is also the reward that we observe in game, like stater, like who won the game. And you can also predict the winner right off line. One of the best performan agents we showed uses mu zero, which is of line. We never use self play, but mu sero basically tries to model based on imulating actions that you may take in the future,. Then pick those that maximis reward or value, but according to your own estimate. There's a lot of tool box, actually, components to be discovered, perhaps

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode