The Data Stack Show cover image

The Data Stack Show

177: AI-Based Data Cleaning, Data Labelling, and Data Enrichment with LLMs Featuring Rishabh Bhargava of refuel

Feb 14, 2024
Rishabh Bhargava, an expert in AI-based data cleaning, data labelling, and data enrichment with LLMs, discusses topics like the evolution of AI and LLMs, implementing use cases and cost considerations, categorizing search queries, benchmarking and evaluation, utilizing customer support ticket data, understanding confidence scores, and training models with human feedback.
01:07:21

Podcast summary created with Snipd AI

Quick takeaways

  • Refule is a platform for data cleaning, labeling, and enrichment using large language models (LLMs).
  • LLMs have rapidly evolved in recent years and are being implemented for tasks such as internal efficiency gains and improving data workflows.

Deep dives

Overview of Refule and the Founder's Background

Refule is a platform for data cleaning, labeling, and enrichment using large language models (LLMs). The CEO and co-founder, Rish Pragava, has a background in data, machine learning, and AI, with experience at Stanford and working as an ML engineer. Refule aims to make data work more efficient by allowing users to write instructions for LLMs to perform tasks instead of manually working with data.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner