Deep Papers cover image

LibreEval: The Largest Open Source Benchmark for RAG Hallucination Detection

Deep Papers

00:00

Creating Large-Scale Dataset for Hallucination Detection in Language Models

This chapter explores the development of a substantial open-source dataset aimed at identifying hallucinations in language models. It details the methodologies employed, including web scraping, data labeling, and distinguishing between synthetic and non-synthetic data.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app