NLP Highlights cover image

60 - FEVER: a large-scale dataset for Fact Extraction and VERification, with James Thorne

NLP Highlights

00:00

How to Construct a Data Set for Fever

Annotators were given a sentence at random from 50,000 most popular pages last year in August. The data set is 185,000 claims - human-written factoid sentences which may be true or false. For each claim, we've annotated at a sentence level from other Wikipedia pages. This can be used as evidence to support or refute these claims. There are six mutation types the annotator was asked to do.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app