We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Computer Science > Computation and Language

Title: Reasoning Chain Based Adversarial Attack for Multi-hop Question Answering

Authors: Jiayu Ding (1), Siyuan Wang (1), Qin Chen (2), Zhongyu Wei (1) ((1) Fudan University, (2) East China Normal University)
Abstract: Recent years have witnessed impressive advances in challenging multi-hop QA tasks. However, these QA models may fail when faced with some disturbance in the input text and their interpretability for conducting multi-hop reasoning remains uncertain. Previous adversarial attack works usually edit the whole question sentence, which has limited effect on testing the entity-based multi-hop inference ability. In this paper, we propose a multi-hop reasoning chain based adversarial attack method. We formulate the multi-hop reasoning chains starting from the query entity to the answer entity in the constructed graph, which allows us to align the question to each reasoning hop and thus attack any hop. We categorize the questions into different reasoning types and adversarially modify part of the question corresponding to the selected reasoning hop to generate the distracting sentence. We test our adversarial scheme on three QA models on HotpotQA dataset. The results demonstrate significant performance reduction on both answer and supporting facts prediction, verifying the effectiveness of our reasoning chain based attack method for multi-hop reasoning models and the vulnerability of them. Our adversarial re-training further improves the performance and robustness of these models.
Comments: 10 pages including reference, 4 figures
Subjects: Computation and Language (cs.CL)
Cite as: arXiv:2112.09658 [cs.CL]
  (or arXiv:2112.09658v1 [cs.CL] for this version)

Submission history

From: Jiayu Ding [view email]
[v1] Fri, 17 Dec 2021 18:03:14 GMT (426kb,D)

Link back to: arXiv, form interface, contact.