HYDRA: Pruning Adversarially Robust Neural Networks

Sehwag, Vikash; Wang, Shiqi; Mittal, Prateek; Jana, Suman

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2002

Computer Science > Computer Vision and Pattern Recognition

Title: HYDRA: Pruning Adversarially Robust Neural Networks

Authors: Vikash Sehwag, Shiqi Wang, Prateek Mittal, Suman Jana

(Submitted on 24 Feb 2020 (v1), last revised 10 Nov 2020 (this version, v3))

Abstract: In safety-critical but computationally resource-constrained applications, deep learning faces two key challenges: lack of robustness against adversarial attacks and large neural network size (often millions of parameters). While the research community has extensively explored the use of robust training and network pruning independently to address one of these challenges, only a few recent works have studied them jointly. However, these works inherit a heuristic pruning strategy that was developed for benign training, which performs poorly when integrated with robust training techniques, including adversarial training and verifiable robust training. To overcome this challenge, we propose to make pruning techniques aware of the robust training objective and let the training objective guide the search for which connections to prune. We realize this insight by formulating the pruning objective as an empirical risk minimization problem which is solved efficiently using SGD. We demonstrate that our approach, titled HYDRA, achieves compressed networks with state-of-the-art benign and robust accuracy, simultaneously. We demonstrate the success of our approach across CIFAR-10, SVHN, and ImageNet dataset with four robust training techniques: iterative adversarial training, randomized smoothing, MixTrain, and CROWN-IBP. We also demonstrate the existence of highly robust sub-networks within non-robust networks. Our code and compressed networks are publicly available at \url{this https URL}.

Comments:	NeurIPS 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2002.10509 [cs.CV]
	(or arXiv:2002.10509v3 [cs.CV] for this version)

Submission history

From: Vikash Sehwag [view email]
[v1] Mon, 24 Feb 2020 19:54:53 GMT (10147kb,D)
[v2] Wed, 1 Jul 2020 14:26:57 GMT (8983kb,D)
[v3] Tue, 10 Nov 2020 15:02:00 GMT (9013kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2002.10509

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: HYDRA: Pruning Adversarially Robust Neural Networks

Submission history