The Inside View cover image

[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment

The Inside View

00:00

One E-Pop Is All You Need

I was just trying to use top-k sampling and temperature sampling. A small transformer language model I was trying in 2018, then, yeah, the result was so much better. But there's a project I did in 2019 about scaling. Yeah, it's called one e-pop is all you need.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app