We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Machine Learning

Title: Adversarial Robustness in Deep Learning: Attacks on Fragile Neurons

Abstract: We identify fragile and robust neurons of deep learning architectures using nodal dropouts of the first convolutional layer. Using an adversarial targeting algorithm, we correlate these neurons with the distribution of adversarial attacks on the network. Adversarial robustness of neural networks has gained significant attention in recent times and highlights intrinsic weaknesses of deep learning networks against carefully constructed distortion applied to input images. In this paper, we evaluate the robustness of state-of-the-art image classification models trained on the MNIST and CIFAR10 datasets against the fast gradient sign method attack, a simple yet effective method of deceiving neural networks. Our method identifies the specific neurons of a network that are most affected by the adversarial attack being applied. We, therefore, propose to make fragile neurons more robust against these attacks by compressing features within robust neurons and amplifying the fragile neurons proportionally.
Subjects: Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Journal reference: Artificial Neural Networks and Machine Learning ICANN 2021
DOI: 10.1007/978-3-030-86362-3_2
Cite as: arXiv:2201.12347 [cs.LG]
  (or arXiv:2201.12347v1 [cs.LG] for this version)

Submission history

From: Varun Ojha [view email]
[v1] Mon, 31 Jan 2022 14:34:07 GMT (1013kb,D)

Link back to: arXiv, form interface, contact.