Neural Search Talks — Zeta Alpha cover image

Transformer Memory as a Differentiable Search Index: memorizing thousands of random doc ids works!?

Neural Search Talks — Zeta Alpha

00:00

Indexing a Corpus Using BM25?

The ideal setup is a multi-task setup where they, in some proportion, use both of these. So during training, sometimes it's document to document ID and sometimes it's query to document ID. I think the former's 32 times more common if I were a later figure correctly,. But it's some mixture like this where they more often do the first task. This is the supervised setting. And then the zero-shot setting is just not using the second task. Then they skip that because this requires labeled data, right? It requires labels of what document is relevant. Assume they don't have it.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app