We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CL

Change to browse by:

cs

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Computation and Language

Title: Exploring and Improving Robustness of Multi Task Deep Neural Networks via Domain Agnostic Defenses

Abstract: In this paper, we explore the robustness of the Multi-Task Deep Neural Networks (MT-DNN) against non-targeted adversarial attacks across Natural Language Understanding (NLU) tasks as well as some possible ways to defend against them. Liu et al., have shown that the Multi-Task Deep Neural Network, due to the regularization effect produced when training as a result of its cross task data, is more robust than a vanilla BERT model trained only on one task (1.1%-1.5% absolute difference). We further show that although the MT-DNN has generalized better, making it easily transferable across domains and tasks, it can still be compromised as after only 2 attacks (1-character and 2-character) the accuracy drops by 42.05% and 32.24% for the SNLI and SciTail tasks. Finally, we propose a domain agnostic defense which restores the model's accuracy (36.75% and 25.94% respectively) as opposed to a general-purpose defense or an off-the-shelf spell checker.
Comments: 10 pages, 3 figures, 3 tables, 24 citations, 11 equations
Subjects: Computation and Language (cs.CL)
MSC classes: 68T35 (Primary)
ACM classes: I.2.7
Cite as: arXiv:2001.05286 [cs.CL]
  (or arXiv:2001.05286v1 [cs.CL] for this version)

Submission history

From: Kashyap Coimbatore Murali [view email]
[v1] Sat, 11 Jan 2020 18:05:15 GMT (579kb,D)

Link back to: arXiv, form interface, contact.