NLP Highlights cover image

128 - Dynamic Benchmarking, with Douwe Kiela

NLP Highlights

00:00

The Effect of Prompts on Sentiments

The hard examples in NLI and the stress test datasets can you talk about that piece yeah. We also evaluated the models on those I think they are very high variance so it's sometimes not easy to say to draw real conclusions about what's going on there. What we found is that if you train on this data you also get better at these phenomena measured by the stress test butYeah as I said I think that should come with the caveat where I don't know what they measure anyway okay are there any other trends I mean you mentioned the audenoscent work and I know that you have a couple of other adversarial liquid datasets as well in general whether any trends interesting trends in any of those

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app