
60 - FEVER: a large-scale dataset for Fact Extraction and VERification, with James Thorne
NLP Highlights
00:00
Annotation Artifacts in Text Generating
The simplest explanation is always often the most right. We had to incorporate that into our guidelines where given the time constraints. And I think it's right to select the earliest occurrence of evidence on a page rather than these really convoluted explanations, which may be difficult for human to understand. This work came out around the time you were building this data set so you probably this probably wasn't much on your mind when you were doing this but any thoughts on how this might have affected your data set?
Transcript
Play full episode