References & Citations
Computer Science > Machine Learning
Title: Make Some Noise: Reliable and Efficient Single-Step Adversarial Training
(Submitted on 2 Feb 2022 (v1), last revised 17 Oct 2022 (this version, v3))
Abstract: Recently, Wong et al. showed that adversarial training with single-step FGSM leads to a characteristic failure mode named Catastrophic Overfitting (CO), in which a model becomes suddenly vulnerable to multi-step attacks. Experimentally they showed that simply adding a random perturbation prior to FGSM (RS-FGSM) could prevent CO. However, Andriushchenko and Flammarion observed that RS-FGSM still leads to CO for larger perturbations, and proposed a computationally expensive regularizer (GradAlign) to avoid it. In this work, we methodically revisit the role of noise and clipping in single-step adversarial training. Contrary to previous intuitions, we find that using a stronger noise around the clean sample combined with \textit{not clipping} is highly effective in avoiding CO for large perturbation radii. We then propose Noise-FGSM (N-FGSM) that, while providing the benefits of single-step adversarial training, does not suffer from CO. Empirical analyses on a large suite of experiments show that N-FGSM is able to match or surpass the performance of previous state-of-the-art GradAlign, while achieving 3x speed-up. Code can be found in this https URL
Submission history
From: Pau de Jorge Aranda [view email][v1] Wed, 2 Feb 2022 18:10:01 GMT (2250kb,D)
[v2] Mon, 20 Jun 2022 21:27:24 GMT (2166kb,D)
[v3] Mon, 17 Oct 2022 22:10:03 GMT (2137kb,D)
Link back to: arXiv, form interface, contact.