Defense Through Diverse Directions

Bender, Christopher M.; Li, Yang; Shi, Yifeng; Reiter, Michael K.; Oliva, Junier B.

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2003

Computer Science > Machine Learning

Title: Defense Through Diverse Directions

Authors: Christopher M. Bender, Yang Li, Yifeng Shi, Michael K. Reiter, Junier B. Oliva

(Submitted on 24 Mar 2020)

Abstract: In this work we develop a novel Bayesian neural network methodology to achieve strong adversarial robustness without the need for online adversarial training. Unlike previous efforts in this direction, we do not rely solely on the stochasticity of network weights by minimizing the divergence between the learned parameter distribution and a prior. Instead, we additionally require that the model maintain some expected uncertainty with respect to all input covariates. We demonstrate that by encouraging the network to distribute evenly across inputs, the network becomes less susceptible to localized, brittle features which imparts a natural robustness to targeted perturbations. We show empirical robustness on several benchmark datasets.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2003.10602 [cs.LG]
	(or arXiv:2003.10602v1 [cs.LG] for this version)

Submission history

From: Christopher Bender [view email]
[v1] Tue, 24 Mar 2020 01:22:03 GMT (888kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2003.10602

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Defense Through Diverse Directions

Submission history