Programming Throwdown cover image

153: ChatGPT

Programming Throwdown

00:00

How Do Human Raiders Can Improve Your GPT Results?

Dolly, you have the query text and an image generation. And that GPT, the thing they did there that is really interesting is this idea called reinforcement learning with human factors or RLHF. All of these models are variational, which means that in addition to passing in the query, you also pass in a random set of numbers. So every time you make a query, you're going to get a different answer for that query. They'll take like five answers for the same query from GPT three. They give it to a human and they ask the human to rank the five answers based on their own personal preference.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app