Towards Understanding Fast Adversarial Training

Li, Bai; Wang, Shiqi; Jana, Suman; Carin, Lawrence

Full-text links:

Download:

Computer Science > Machine Learning

Title: Towards Understanding Fast Adversarial Training

Authors: Bai Li, Shiqi Wang, Suman Jana, Lawrence Carin

(Submitted on 4 Jun 2020)

Abstract: Current neural-network-based classifiers are susceptible to adversarial examples. The most empirically successful approach to defending against such adversarial examples is adversarial training, which incorporates a strong self-attack during training to enhance its robustness. This approach, however, is computationally expensive and hence is hard to scale up. A recent work, called fast adversarial training, has shown that it is possible to markedly reduce computation time without sacrificing significant performance. This approach incorporates simple self-attacks, yet it can only run for a limited number of training epochs, resulting in sub-optimal performance. In this paper, we conduct experiments to understand the behavior of fast adversarial training and show the key to its success is the ability to recover from overfitting to weak attacks. We then extend our findings to improve fast adversarial training, demonstrating superior robust accuracy to strong adversarial training, with much-reduced training time.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2006.03089 [cs.LG]
	(or arXiv:2006.03089v1 [cs.LG] for this version)

Submission history

From: Bai Li [view email]
[v1] Thu, 4 Jun 2020 18:19:43 GMT (620kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.03089

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Towards Understanding Fast Adversarial Training

Submission history