Revisiting Ensembles in an Adversarial Context: Improving Natural Accuracy

Saligrama, Aditya; Leclerc, Guillaume

Full-text links:

Download:

Current browse context:

stat.ML

< prev | next >

new | recent | 2002

Statistics > Machine Learning

Title: Revisiting Ensembles in an Adversarial Context: Improving Natural Accuracy

Authors: Aditya Saligrama, Guillaume Leclerc

(Submitted on 26 Feb 2020)

Abstract: A necessary characteristic for the deployment of deep learning models in real world applications is resistance to small adversarial perturbations while maintaining accuracy on non-malicious inputs. While robust training provides models that exhibit better adversarial accuracy than standard models, there is still a significant gap in natural accuracy between robust and non-robust models which we aim to bridge. We consider a number of ensemble methods designed to mitigate this performance difference. Our key insight is that model trained to withstand small attacks, when ensembled, can often withstand significantly larger attacks, and this concept can in turn be leveraged to optimize natural accuracy. We consider two schemes, one that combines predictions from several randomly initialized robust models, and the other that fuses features from robust and standard models.

Comments:	5 pages, accepted to ICLR 2020 Workshop on Towards Trustworthy ML: Rethinking Security and Privacy for ML
Subjects:	Machine Learning (stat.ML); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2002.11572 [stat.ML]
	(or arXiv:2002.11572v1 [stat.ML] for this version)

Submission history

From: Aditya Saligrama [view email]
[v1] Wed, 26 Feb 2020 15:45:58 GMT (156kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2002.11572

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Revisiting Ensembles in an Adversarial Context: Improving Natural Accuracy

Submission history