
128 - Dynamic Benchmarking, with Douwe Kiela
NLP Highlights
The Importance of Keeping Models in the Loop
The exact task setup on the dinebench platform is actually determined by the task owners. The decision to keep things fixed right now and not allow people to edit passages or context that's up to the task owners so this would be quite easy to do from a web interface. I understand your motivation in using the stronger model in subsequent rounds but is that the requirement do you think at some point at some point you will not be able to find adversarial examples if you kept using the same model which is just getting trained on your data? Yeah that's a great question. So yeah I hope that as a field we keep thinking about these issues because there are clear benefits to the approach but there are
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.