Robust Universal Adversarial Perturbations

Xu, Changming; Singh, Gagandeep

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2206

Computer Science > Machine Learning

Title: Robust Universal Adversarial Perturbations

Authors: Changming Xu, Gagandeep Singh

(Submitted on 22 Jun 2022 (this version), latest version 6 Jun 2023 (v2))

Abstract: Universal Adversarial Perturbations (UAPs) are imperceptible, image-agnostic vectors that cause deep neural networks (DNNs) to misclassify inputs from a data distribution with high probability. Existing methods do not create UAPs robust to transformations, thereby limiting their applicability as a real-world attacks. In this work, we introduce a new concept and formulation of robust universal adversarial perturbations. Based on our formulation, we build a novel, iterative algorithm that leverages probabilistic robustness bounds for generating UAPs robust against transformations generated by composing arbitrary sub-differentiable transformation functions. We perform an extensive evaluation on the popular CIFAR-10 and ILSVRC 2012 datasets measuring robustness under human-interpretable semantic transformations, such as rotation, contrast changes, etc, that are common in the real-world. Our results show that our generated UAPs are significantly more robust than those from baselines.

Comments:	16 pages, 3 figures
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Cite as:	arXiv:2206.10858 [cs.LG]
	(or arXiv:2206.10858v1 [cs.LG] for this version)

Submission history

From: Changming Xu [view email]
[v1] Wed, 22 Jun 2022 06:05:30 GMT (2579kb,D)
[v2] Tue, 6 Jun 2023 05:16:38 GMT (8559kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2206.10858v1

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Robust Universal Adversarial Perturbations

Submission history