Gradient Dissent: Conversations on AI cover image

Peter & Boris — Fine-tuning OpenAI's GPT-3

Gradient Dissent: Conversations on AI

00:00

Train Interference

In principle you can set it out to be quite reproducible. As long as you can keep those random stations consistent between your train nd runs, you're essentially going to get the same model at the end of it is going to be fairly reproducible. But there's enough like noise and a little weirdness with like floating point arithmetic and song an these te vews, with these really big models, thata. It's very hard to conte complete te determinism.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app