MLOps.community  cover image

Cleanlab: Labeled Datasets that Correct Themselves Automatically // Curtis Northcutt // MLOps Coffee Sessions #105

MLOps.community

CHAPTER

The Journey to Clean and Reliable Data

In this chapter, the hosts discuss their journey in improving data labeling accuracy over the past six years, and introduce Clean Lab as a solution. They explain the concept of positive unlabeled learning, generalize their solution to the full binary case, and share their early research on rank pruning. They also discuss their experience at Facebook AI Research and Amazon, addressing bias in comment rankings and determining false negative rates for Alexa devices.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner