AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Future of Evaluating Language Models
The core set of eval modules that we have within LOM index actually are ground truth free or label free. This is still like something that people are exploring these days even in the space of German of AI and LLM's you have ground truth like text and then you want some way of scoring how close this predicted text is to a ground truth text. And it's interesting because one it makes use of LLM based evaluation which is kind of like an interesting way to think about it basically using the language model to evaluate itself right I'm sure there's like downsides which we can get into but you know a lot of people are doing it these days too.