
LibreEval: The Largest Open Source Benchmark for RAG Hallucination Detection
Deep Papers
Creating Large-Scale Dataset for Hallucination Detection in Language Models
This chapter explores the development of a substantial open-source dataset aimed at identifying hallucinations in language models. It details the methodologies employed, including web scraping, data labeling, and distinguishing between synthetic and non-synthetic data.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.