67 - GLUE: A Multi-Task Benchmark and Analysis Platform, with Sam Bowman

Aug 27, 2018

Ask episode

Chapters

Transcript

Episode notes

Glue: A Shared Task on Multitask Learning

The Future of Language Understanding Tasks

The Diagnostic Data Set for GLUE: A Set of 1,000 Textual Entailment Examples

How to Train a Textual Entailment for Sentiment Analysis for Semantic Similarity

The Limits of Generalization in Language Understanding

The Future of Reusable Sentence Understanding Tools

Glue: A Framework for Neural Networks

How to Train a Multi-Task Model

The Importance of Sentence Vectors

Why Leaderboards Encourage Bad Science

The Risk of Cross-Paper Comparisons