Towards Data Science cover image

120. Liam Fedus and Barrett Zoph - AI scaling with mixture of expert models

Towards Data Science

00:00

Scaling to a Significant Ease

The goal of scaling toa significant extent is generalization, right? That's really the the holy grail here. I've seen, or i've noted in some of your work, that the historically, it seems like mixture of experts model, but models have struggled a little bit more with generalization. Why? Why would it be harder for m oes to generalize than for dense nets? Ya, yes, a great question. And this is something that un ber and i have thought about for at least a couple of years now.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app