
635: The Perils of Manually Labeling Data for Machine Learning Models
Super Data Science: ML & AI Podcast with Jon Krohn
00:00
Using Probabilistic Labels in Computer Science
In our product, we decided that weak supervision was going to be one of the tools that we use to solve the labeling problem. What we realized is that these labeling functions are really like search functions in a lot of ways. So given some input, you're assessing some logic and you're returning to a false. That same function signature exists in search engines. Is it similar to what you're looking for? It's like a yes or no question. And you can rank it, obviously. You can say how similar it is and that sort of thing. Super cool. All right. We've talked about this idea of weekly supervised learning thoroughly. And I love this idea of probabilistic
Transcript
Play full episode