CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about Negation

Ravichander, Abhilasha; Gardner, Matt; Marasović, Ana

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2211

Computer Science > Computation and Language

Title: CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about Negation

Authors: Abhilasha Ravichander, Matt Gardner, Ana Marasović

(Submitted on 1 Nov 2022)

Abstract: The full power of human language-based communication cannot be realized without negation. All human languages have some form of negation. Despite this, negation remains a challenging phenomenon for current natural language understanding systems. To facilitate the future development of models that can process negation effectively, we present CONDAQA, the first English reading comprehension dataset which requires reasoning about the implications of negated statements in paragraphs. We collect paragraphs with diverse negation cues, then have crowdworkers ask questions about the implications of the negated statement in the passage. We also have workers make three kinds of edits to the passage -- paraphrasing the negated statement, changing the scope of the negation, and reversing the negation -- resulting in clusters of question-answer pairs that are difficult for models to answer with spurious shortcuts. CONDAQA features 14,182 question-answer pairs with over 200 unique negation cues and is challenging for current state-of-the-art models. The best performing model on CONDAQA (UnifiedQA-v2-3b) achieves only 42% on our consistency metric, well below human performance which is 81%. We release our dataset, along with fully-finetuned, few-shot, and zero-shot evaluations, to facilitate the development of future NLP methods that work on negated language.

Comments:	EMNLP 2022
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2211.00295 [cs.CL]
	(or arXiv:2211.00295v1 [cs.CL] for this version)

Submission history

From: Abhilasha Ravichander [view email]
[v1] Tue, 1 Nov 2022 06:10:26 GMT (13508kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2211.00295

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: CONDAQA: A Contrastive Reading Comprehension Dataset for Reasoning about Negation

Submission history