Gradient Dissent: Conversations on AI cover image

Peter & Boris — Fine-tuning OpenAI's GPT-3

Gradient Dissent: Conversations on AI

00:00

GPT 3

The big difference between depity two and dpty was really, it was trained on more data. It was a bigger model, diked by two orders of minitude. The surprising thing ist that sort of wad it took to can em go from feeling lycanav dumb to interact with,. But it also felt canno incredibly stupid most of the time. And i think with debiity it went to being like, you know, sometimes just surprisingly good.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app