AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Importance of Open Source for Interpretability
This is an important problem, as we've mentioned, of close nation and this appears to be a pretty good breakthrough on that. They found that using their technique, certain kinds of falsehoods can get weeded out of the system more easily than others. It's kind of interesting to see that like the model seems to understand truthfulness as a concept for some fields, but then for other fields, it seems to be maybe a little less good at internally recognizing the truth value of these statements. And one last thought here is they do build on top of llama.