The Effect of Prompts on Sentiments

The hard examples in NLI and the stress test datasets can you talk about that piece yeah. We also evaluated the models on those I think they are very high variance so it's sometimes not easy to say to draw real conclusions about what's going on there. What we found is that if you train on this data you also get better at these phenomena measured by the stress test butYeah as I said I think that should come with the caveat where I don't know what they measure anyway okay are there any other trends I mean you mentioned the audenoscent work and I know that you have a couple of other adversarial liquid datasets as well in general whether any trends interesting trends in any of those

Play episode from 16:51

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app