
LibreEval: The Largest Open Source Benchmark for RAG Hallucination Detection
Deep Papers
00:00
Creating Large-Scale Dataset for Hallucination Detection in Language Models
This chapter explores the development of a substantial open-source dataset aimed at identifying hallucinations in language models. It details the methodologies employed, including web scraping, data labeling, and distinguishing between synthetic and non-synthetic data.
Transcript
Play full episode