

67 - GLUE: A Multi-Task Benchmark and Analysis Platform, with Sam Bowman
Aug 27, 2018
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12
Introduction
00:00 • 3min
Glue: A Shared Task on Multitask Learning
02:34 • 4min
The Future of Language Understanding Tasks
06:48 • 4min
The Diagnostic Data Set for GLUE: A Set of 1,000 Textual Entailment Examples
10:29 • 6min
How to Train a Textual Entailment for Sentiment Analysis for Semantic Similarity
16:26 • 2min
The Limits of Generalization in Language Understanding
18:00 • 5min
The Future of Reusable Sentence Understanding Tools
22:58 • 3min
Glue: A Framework for Neural Networks
26:12 • 2min
How to Train a Multi-Task Model
27:59 • 5min
The Importance of Sentence Vectors
32:40 • 2min
Why Leaderboards Encourage Bad Science
34:22 • 3min
The Risk of Cross-Paper Comparisons
37:40 • 2min