MLOps.community  cover image

Cleanlab: Labeled Datasets that Correct Themselves Automatically // Curtis Northcutt // MLOps Coffee Sessions #105

MLOps.community

00:00

The Journey to Clean and Reliable Data

In this chapter, the hosts discuss their journey in improving data labeling accuracy over the past six years, and introduce Clean Lab as a solution. They explain the concept of positive unlabeled learning, generalize their solution to the full binary case, and share their early research on rank pruning. They also discuss their experience at Facebook AI Research and Amazon, addressing bias in comment rankings and determining false negative rates for Alexa devices.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app