What Does Quality Mean?

The paper uses an ensomble of networks, each one generated by taking a pre trained model. They replace the final layer with a ranimly initialized linear layer and fine tuning the whole model based on human preference comparisons. I think maybe the upshot was ansamling does work, but it's expensive because you have to have a bunch of copies your model. And in some sense, i think it's like, sort of ok, or, like, its sort f doesn't matter exactly what metric o use here for this point of our research, as long as we are being consistent with it.

Play episode from 34:31

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app