Exploiting the Sensitivity of $L_2$ Adversarial Examples to Erase-and-Restore

Zuo, Fei; Zeng, Qiang

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2001

Computer Science > Computer Vision and Pattern Recognition

Title: Exploiting the Sensitivity of $L_2$ Adversarial Examples to Erase-and-Restore

Authors: Fei Zuo, Qiang Zeng

(Submitted on 1 Jan 2020 (v1), last revised 12 Dec 2020 (this version, v2))

Abstract: By adding carefully crafted perturbations to input images, adversarial examples (AEs) can be generated to mislead neural-network-based image classifiers. $L_2$ adversarial perturbations by Carlini and Wagner (CW) are among the most effective but difficult-to-detect attacks. While many countermeasures against AEs have been proposed, detection of adaptive CW-$L_2$ AEs is still an open question. We find that, by randomly erasing some pixels in an $L_2$ AE and then restoring it with an inpainting technique, the AE, before and after the steps, tends to have different classification results, while a benign sample does not show this symptom. We thus propose a novel AE detection technique, Erase-and-Restore (E&R), that exploits the intriguing sensitivity of $L_2$ attacks. Experiments conducted on two popular image datasets, CIFAR-10 and ImageNet, show that the proposed technique is able to detect over 98% of $L_2$ AEs and has a very low false positive rate on benign images. The detection technique exhibits high transferability: a detection system trained using CW-$L_2$ AEs can accurately detect AEs generated using another $L_2$ attack method. More importantly, our approach demonstrates strong resilience to adaptive $L_2$ attacks, filling a critical gap in AE detection. Finally, we interpret the detection technique through both visualization and quantification.

Comments:	Accepted to AsiaCCS'21 on 10/24/2020; 12 pages; the code, datasets, and models will be made publicly available when the paper is presented
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2001.00116 [cs.CV]
	(or arXiv:2001.00116v2 [cs.CV] for this version)

Submission history

From: Qiang Zeng [view email]
[v1] Wed, 1 Jan 2020 00:15:07 GMT (2442kb,D)
[v2] Sat, 12 Dec 2020 23:48:02 GMT (6650kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2001.00116

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Exploiting the Sensitivity of $L_2$ Adversarial Examples to Erase-and-Restore

Submission history