
60 - FEVER: a large-scale dataset for Fact Extraction and VERification, with James Thorne
NLP Highlights
00:00
How to Construct a Data Set for Fever
Annotators were given a sentence at random from 50,000 most popular pages last year in August. The data set is 185,000 claims - human-written factoid sentences which may be true or false. For each claim, we've annotated at a sentence level from other Wikipedia pages. This can be used as evidence to support or refute these claims. There are six mutation types the annotator was asked to do.
Transcript
Play full episode