AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
What Does Quality Mean?
The paper uses an ensomble of networks, each one generated by taking a pre trained model. They replace the final layer with a ranimly initialized linear layer and fine tuning the whole model based on human preference comparisons. I think maybe the upshot was ansamling does work, but it's expensive because you have to have a bunch of copies your model. And in some sense, i think it's like, sort of ok, or, like, its sort f doesn't matter exactly what metric o use here for this point of our research, as long as we are being consistent with it.