Commonsense knowledge adversarial dataset that challenges ELECTRA

Lin, Gongqi; Miao, Yuan; Yang, Xiaoyong; Ou, Wenwu; Cui, Lizhen; Guo, Wei; Miao, Chunyan

Full-text links:

Download:

PDF only

Current browse context:

cs.CL

< prev | next >

new | recent | 2010

Change to browse by:

Computer Science > Computation and Language

Title: Commonsense knowledge adversarial dataset that challenges ELECTRA

Authors: Gongqi Lin, Yuan Miao, Xiaoyong Yang, Wenwu Ou, Lizhen Cui, Wei Guo, Chunyan Miao

(Submitted on 25 Oct 2020)

Abstract: Commonsense knowledge is critical in human reading comprehension. While machine comprehension has made significant progress in recent years, the ability in handling commonsense knowledge remains limited. Synonyms are one of the most widely used commonsense knowledge. Constructing adversarial dataset is an important approach to find weak points of machine comprehension models and support the design of solutions. To investigate machine comprehension models' ability in handling the commonsense knowledge, we created a Question and Answer Dataset with common knowledge of Synonyms (QADS). QADS are questions generated based on SQuAD 2.0 by applying commonsense knowledge of synonyms. The synonyms are extracted from WordNet. Words often have multiple meanings and synonyms. We used an enhanced Lesk algorithm to perform word sense disambiguation to identify synonyms for the context. ELECTRA achieves the state-of-art result on the SQuAD 2.0 dataset in 2019. With scale, ELECTRA can achieve similar performance as BERT does. However, QADS shows that ELECTRA has little ability to handle commonsense knowledge of synonyms. In our experiment, ELECTRA-small can achieve 70% accuracy on SQuAD 2.0, but only 20% on QADS. ELECTRA-large did not perform much better. Its accuracy on SQuAD 2.0 is 88% but dropped significantly to 26% on QADS. In our earlier experiments, BERT, although also failed badly on QADS, was not as bad as ELECTRA. The result shows that even top-performing NLP models have little ability to handle commonsense knowledge which is essential in reading comprehension.

Comments:	To appear in ICARCV2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2010.13049 [cs.CL]
	(or arXiv:2010.13049v1 [cs.CL] for this version)

Submission history

From: Gongqi Lin [view email]
[v1] Sun, 25 Oct 2020 07:17:45 GMT (337kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2010.13049

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: Commonsense knowledge adversarial dataset that challenges ELECTRA

Submission history