Practical AI cover image

ChatGPT goes prime time!

Practical AI

00:00

How Chat GPT Was Trained

Chat GPT was trained using a reinforcement learning approach, and other models using this same approach are also pretrained language models./nThe first reward model is trained to take in a prompt and a response and score it like a human would score it according to preference./nThe second reward model is trained to take in a prompt and a response and output a prediction of what a human preference might be on this output.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app