Dual Head Adversarial Training

Jiang, Yujing; Ma, Xingjun; Erfani, Sarah Monazam; Bailey, James

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2104

Computer Science > Machine Learning

Title: Dual Head Adversarial Training

Authors: Yujing Jiang, Xingjun Ma, Sarah Monazam Erfani, James Bailey

(Submitted on 21 Apr 2021 (v1), last revised 22 Apr 2021 (this version, v2))

Abstract: Deep neural networks (DNNs) are known to be vulnerable to adversarial examples/attacks, raising concerns about their reliability in safety-critical applications. A number of defense methods have been proposed to train robust DNNs resistant to adversarial attacks, among which adversarial training has so far demonstrated the most promising results. However, recent studies have shown that there exists an inherent tradeoff between accuracy and robustness in adversarially-trained DNNs. In this paper, we propose a novel technique Dual Head Adversarial Training (DH-AT) to further improve the robustness of existing adversarial training methods. Different from existing improved variants of adversarial training, DH-AT modifies both the architecture of the network and the training strategy to seek more robustness. Specifically, DH-AT first attaches a second network head (or branch) to one intermediate layer of the network, then uses a lightweight convolutional neural network (CNN) to aggregate the outputs of the two heads. The training strategy is also adapted to reflect the relative importance of the two heads. We empirically show, on multiple benchmark datasets, that DH-AT can bring notable robustness improvements to existing adversarial training methods. Compared with TRADES, one state-of-the-art adversarial training method, our DH-AT can improve the robustness by 3.4% against PGD40 and 2.3% against AutoAttack, and also improve the clean accuracy by 1.8%.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2104.10377 [cs.LG]
	(or arXiv:2104.10377v2 [cs.LG] for this version)

Submission history

From: Yujing Jiang [view email]
[v1] Wed, 21 Apr 2021 06:31:33 GMT (369kb,D)
[v2] Thu, 22 Apr 2021 06:01:25 GMT (369kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2104.10377

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Dual Head Adversarial Training

Submission history