
Task-aware Retrieval with Instructions
Neural Search Talks β Zeta Alpha
00:00
What Is Special About This Training Setup?
The name comes from the instructions. So one difference is this knowledge distillation. And I find distillation to be a bit of a confusing word here because really all it means is you're selecting hard negatives with a different model. Rather than using many LM re-ranker to pick hard negatives, they're using the TART, re-Ranker to pickHardNegatives. Yeah. It should be able to pick even harder negatives than the model that didn't know about the instructions. Right. That has been done. The training setup deserves its own name kind of? Um, yeah.
Transcript
Play full episode