AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
The Journey to Clean and Reliable Data
In this chapter, the hosts discuss their journey in improving data labeling accuracy over the past six years, and introduce Clean Lab as a solution. They explain the concept of positive unlabeled learning, generalize their solution to the full binary case, and share their early research on rank pruning. They also discuss their experience at Facebook AI Research and Amazon, addressing bias in comment rankings and determining false negative rates for Alexa devices.