The Inside View cover image

[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment

The Inside View

00:00

How JPTJ Compares to JPT3 and JPTNEO

I read the blog post wrote about JPTJ. And so I kind of read about like all those tricks you did. And you talk a lot about throughput. So yeah, I'm curious, like what's the, yeah, throughput for people were not like scaling models all the time. And yeah, how does the compare to like JPT3 or JPTNEO in terms of performance? Is it more efficient, less efficient, more uptake, less overflow?

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app