NLP Highlights cover image

128 - Dynamic Benchmarking, with Douwe Kiela

NLP Highlights

CHAPTER

How to Scale a Dynamic Task Platform

We have a robustness and an if fairness metric but i don't think that there are really well-established ways in the field to measure those yet. We take a particular approach where we have perturbations and we look at whether a fairness perturbation affects your ultimate prediction. One of our real hopes is for the field to come up with better metrics so that we can incorporate them because we're dynamic we can. It be easy to add the new performance measures to that about as well is that okay?

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner