Why Leaderboards Encourage Bad Science

I feel like things like Elmo or language models are just inherently more scalable to information content than a single vector. And you're right that for some applications, say I want to have some huge database of sentences that I can do quick lookups in this vector space. But even then, the particular vector that you're going to use is going to have some extracted features that is a fixed length set. And so however you pre-train this is going to be focused on some particular objective, and hopefully it matches how you want to actually use it.

Play episode from 34:22

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app