DataTalks.Club cover image

Dataset Creation and Curation - Christiaan Swart

DataTalks.Club

CHAPTER

Is There a Good Source for Weak Labelling?

topic moter is a good source of weak labels. If you find a sentence that has a drug and a disease entity in it, then that's a good candidate for having this type label. You can also use all of these space languistic features, a imnent te recognition, i don't know, part of speech, ti i setaraly, all o these type of things. And then you also have a layer on top of this that can weigh that for you to make sure that it's youre getting the best rit bank for buck.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner