One pixel attack for fooling deep neural networks

Su, Jiawei; Vargas, Danilo Vasconcellos; Kouichi, Sakurai

doi:10.1109/TEVC.2019.2890858

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 1710

Computer Science > Machine Learning

Title: One pixel attack for fooling deep neural networks

Authors: Jiawei Su, Danilo Vasconcellos Vargas, Sakurai Kouichi

(Submitted on 24 Oct 2017 (v1), last revised 17 Oct 2019 (this version, v7))

Abstract: Recent research has revealed that the output of Deep Neural Networks (DNN) can be easily altered by adding relatively small perturbations to the input vector. In this paper, we analyze an attack in an extremely limited scenario where only one pixel can be modified. For that we propose a novel method for generating one-pixel adversarial perturbations based on differential evolution (DE). It requires less adversarial information (a black-box attack) and can fool more types of networks due to the inherent features of DE. The results show that 67.97% of the natural images in Kaggle CIFAR-10 test dataset and 16.04% of the ImageNet (ILSVRC 2012) test images can be perturbed to at least one target class by modifying just one pixel with 74.03% and 22.91% confidence on average. We also show the same vulnerability on the original CIFAR-10 dataset. Thus, the proposed attack explores a different take on adversarial machine learning in an extreme limited scenario, showing that current DNNs are also vulnerable to such low dimension attacks. Besides, we also illustrate an important application of DE (or broadly speaking, evolutionary computation) in the domain of adversarial machine learning: creating tools that can effectively generate low-cost adversarial attacks against neural networks for evaluating robustness.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Journal reference:	IEEE Transactions on Evolutionary Computation}, Vol.23 , Issue.5 , pp. 828--841. Publisher: IEEE. 2019
DOI:	10.1109/TEVC.2019.2890858
Cite as:	arXiv:1710.08864 [cs.LG]
	(or arXiv:1710.08864v7 [cs.LG] for this version)

Submission history

From: Jiawei Su [view email]
[v1] Tue, 24 Oct 2017 16:02:19 GMT (815kb,D)
[v2] Thu, 16 Nov 2017 07:58:35 GMT (958kb,D)
[v3] Fri, 16 Feb 2018 08:53:44 GMT (1121kb,D)
[v4] Thu, 22 Feb 2018 09:18:34 GMT (1129kb,D)
[v5] Mon, 28 Jan 2019 04:39:30 GMT (1494kb,D)
[v6] Fri, 3 May 2019 08:32:24 GMT (4475kb,D)
[v7] Thu, 17 Oct 2019 07:46:53 GMT (2233kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:1710.08864

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: One pixel attack for fooling deep neural networks

Submission history