Adversarial Robustness in Deep Learning: Attacks on Fragile Neurons

Pravin, Chandresh; Martino, Ivan; Nicosia, Giuseppe; Ojha, Varun

doi:10.1007/978-3-030-86362-3_2

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2201

Computer Science > Machine Learning

Title: Adversarial Robustness in Deep Learning: Attacks on Fragile Neurons

Authors: Chandresh Pravin, Ivan Martino, Giuseppe Nicosia, Varun Ojha

(Submitted on 31 Jan 2022)

Abstract: We identify fragile and robust neurons of deep learning architectures using nodal dropouts of the first convolutional layer. Using an adversarial targeting algorithm, we correlate these neurons with the distribution of adversarial attacks on the network. Adversarial robustness of neural networks has gained significant attention in recent times and highlights intrinsic weaknesses of deep learning networks against carefully constructed distortion applied to input images. In this paper, we evaluate the robustness of state-of-the-art image classification models trained on the MNIST and CIFAR10 datasets against the fast gradient sign method attack, a simple yet effective method of deceiving neural networks. Our method identifies the specific neurons of a network that are most affected by the adversarial attack being applied. We, therefore, propose to make fragile neurons more robust against these attacks by compressing features within robust neurons and amplifying the fragile neurons proportionally.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Journal reference:	Artificial Neural Networks and Machine Learning ICANN 2021
DOI:	10.1007/978-3-030-86362-3_2
Cite as:	arXiv:2201.12347 [cs.LG]
	(or arXiv:2201.12347v1 [cs.LG] for this version)

Submission history

From: Varun Ojha [view email]
[v1] Mon, 31 Jan 2022 14:34:07 GMT (1013kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.12347

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Adversarial Robustness in Deep Learning: Attacks on Fragile Neurons

Submission history