CE-based white-box adversarial attacks will not work using super-fitting

Yang, Youhuan; Sun, Lei; Dai, Leyu; Guo, Song; Mao, Xiuqing; Wang, Xiaoqin; Xu, Bayi

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2205

Computer Science > Machine Learning

Title: CE-based white-box adversarial attacks will not work using super-fitting

Authors: Youhuan Yang, Lei Sun, Leyu Dai, Song Guo, Xiuqing Mao, Xiaoqin Wang, Bayi Xu

(Submitted on 4 May 2022 (v1), last revised 15 May 2022 (this version, v2))

Abstract: Deep neural networks are widely used in various fields because of their powerful performance. However, recent studies have shown that deep learning models are vulnerable to adversarial attacks, i.e., adding a slight perturbation to the input will make the model obtain wrong results. This is especially dangerous for some systems with high-security requirements, so this paper proposes a new defense method by using the model super-fitting state to improve the model's adversarial robustness (i.e., the accuracy under adversarial attacks). This paper mathematically proves the effectiveness of super-fitting and enables the model to reach this state quickly by minimizing unrelated category scores (MUCS). Theoretically, super-fitting can resist any existing (even future) CE-based white-box adversarial attacks. In addition, this paper uses a variety of powerful attack algorithms to evaluate the adversarial robustness of super-fitting, and the proposed method is compared with nearly 50 defense models from recent conferences. The experimental results show that the super-fitting method in this paper can make the trained model obtain the highest adversarial robustness.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR)
Cite as:	arXiv:2205.02741 [cs.LG]
	(or arXiv:2205.02741v2 [cs.LG] for this version)

Submission history

From: Youhuan Yang [view email]
[v1] Wed, 4 May 2022 09:23:00 GMT (623kb)
[v2] Sun, 15 May 2022 12:17:35 GMT (405kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2205.02741

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: CE-based white-box adversarial attacks will not work using super-fitting

Submission history