
114 - Behavioral Testing of NLP Models, with Marco Tulio Ribeiro
NLP Highlights
00:00
The Failure Rate Depends on How Many Items You Have in Your Instantiation
Some of these tests are essentially templets. If i have a list of a hundred sentiment laden words, that's going to gie me a different failure rate. Depends on which words i have. So maybe a moder is really good on very common ones and not very good on distinct ones. Roughly speaking, i think the percentage doesn't matter as much as so much as is it high enough to be troubling.
Transcript
Play full episode