The Inside View cover image

[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment

The Inside View

00:00

The Importance of Scaling for GPT-like Models

The current state of the art model has like 100 billion armatures and trying on tallidians of tokens, which is very different from how human planes land. And I think that means it has more capacity than the former capacity than models do. So, yeah, basically I'm trying to make the models closer to how human planes look. Yeah. You kind of want to combine this RL from human feedback professor from intro GPT with the pre-training from T0.Yeah. By the way, in color decoder model performs much better than decoder only model when it is fine-tuned for a multitask function. That's why I'm thinking of this and quality code model

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app