The Inside View cover image

[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment

The Inside View

00:00

How to Fine-Tune a TensorFlow Model

We spent five weeks using 256 cores of TPU V3. And then at the end, you published this model on GitHub. People are very excited and start to use it to fine-tune to a bunch of different cases. It appears that JPTJ is more easier to deal with than other models like JPT Neo. Have you seen this recent YouTube video? And if so, what do you think about it?

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app