Jacobian Regularization for Mitigating Universal Adversarial Perturbations

Co, Kenneth T.; Rego, David Martinez; Lupu, Emil C.

doi:10.1007/978-3-030-86380-7_17

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2104

Computer Science > Machine Learning

Title: Jacobian Regularization for Mitigating Universal Adversarial Perturbations

Authors: Kenneth T. Co, David Martinez Rego, Emil C. Lupu

(Submitted on 21 Apr 2021 (v1), last revised 13 Sep 2021 (this version, v2))

Abstract: Universal Adversarial Perturbations (UAPs) are input perturbations that can fool a neural network on large sets of data. They are a class of attacks that represents a significant threat as they facilitate realistic, practical, and low-cost attacks on neural networks. In this work, we derive upper bounds for the effectiveness of UAPs based on norms of data-dependent Jacobians. We empirically verify that Jacobian regularization greatly increases model robustness to UAPs by up to four times whilst maintaining clean performance. Our theoretical analysis also allows us to formulate a metric for the strength of shared adversarial perturbations between pairs of inputs. We apply this metric to benchmark datasets and show that it is highly correlated with the actual observed robustness. This suggests that realistic and practical universal attacks can be reliably mitigated without sacrificing clean accuracy, which shows promise for the robustness of machine learning systems.

Comments:	In Proceedings of the 30th International Conference on Artificial Neural Networks (ICANN 2021), related code available at: this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
DOI:	10.1007/978-3-030-86380-7_17
Cite as:	arXiv:2104.10459 [cs.LG]
	(or arXiv:2104.10459v2 [cs.LG] for this version)

Submission history

From: Kenneth Co [view email]
[v1] Wed, 21 Apr 2021 11:00:21 GMT (318kb)
[v2] Mon, 13 Sep 2021 00:01:57 GMT (317kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2104.10459

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Jacobian Regularization for Mitigating Universal Adversarial Perturbations

Submission history