How Do Human Raiders Can Improve Your GPT Results?

Dolly, you have the query text and an image generation. And that GPT, the thing they did there that is really interesting is this idea called reinforcement learning with human factors or RLHF. All of these models are variational, which means that in addition to passing in the query, you also pass in a random set of numbers. So every time you make a query, you're going to get a different answer for that query. They'll take like five answers for the same query from GPT three. They give it to a human and they ask the human to rank the five answers based on their own personal preference.

Play episode from 01:03:53

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app