
Benchmarking IR Models (w/ Nandan Thakur)
Neural Search Talks — Zeta Alpha
00:00
Evaluating Information Retrieval Systems with LLMs
This chapter explores the philosophy and methods for assessing information retrieval systems, focusing on the role of large language models as evaluators. It addresses the challenges of human assessors and the need for reliable gold standards, while also discussing future competition opportunities and public leaderboards.
Transcript
Play full episode