Deep Papers cover image

LibreEval: The Largest Open Source Benchmark for RAG Hallucination Detection

Deep Papers

CHAPTER

Creating Large-Scale Dataset for Hallucination Detection in Language Models

This chapter explores the development of a substantial open-source dataset aimed at identifying hallucinations in language models. It details the methodologies employed, including web scraping, data labeling, and distinguishing between synthetic and non-synthetic data.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner