AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
How to Evaluate Model Convergence Without Human Judgment
There are many problems in generating, weare generating things. There's no test set ri you have to read the text and ask, ok, does it look human or not? And that sort of word. Another would be, let's say you're doing search relevants. I'm trying to predict what somebody wants to click on. It's very expensive to run an a b test. You can like make a mod like an s b m model, to predict what people will click on but you don't really know how it's going to perform an se yit put in production.