NLP Highlights cover image

128 - Dynamic Benchmarking, with Douwe Kiela

NLP Highlights

00:00

Building a Question Answering Model

When you're building a question answering model you should be able to answer questions that real people would ask if you deployed the model in production. I think it's very doable to find natural looking examples that are completely normal questions that the model still gets wrong. We want the average case which is what we're measuring especially with these huge datasets but it's also very narrow like spot is only Wikipedia context or domain and stuff like that right so we need to go beyond that over time and we need to be more robust also to the worst case.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app