
67 - GLUE: A Multi-Task Benchmark and Analysis Platform, with Sam Bowman
NLP Highlights
00:00
Glue: A Shared Task on Multitask Learning
Glue is a shared task on multitask learning with nine target tasks. The goal is to build a model that does well in aggregate across all the tasks. We essentially just measure performance on each of those tasks, take an average, that's your score. And we have some software tools you can download to make evaluation a little bit easier.
Transcript
Play full episode