A Bayes-Optimal View on Adversarial Examples

Richardson, Eitan; Weiss, Yair

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2002

Computer Science > Machine Learning

Title: A Bayes-Optimal View on Adversarial Examples

Authors: Eitan Richardson, Yair Weiss

(Submitted on 20 Feb 2020 (v1), last revised 17 Mar 2021 (this version, v2))

Abstract: Since the discovery of adversarial examples - the ability to fool modern CNN classifiers with tiny perturbations of the input, there has been much discussion whether they are a "bug" that is specific to current neural architectures and training methods or an inevitable "feature" of high dimensional geometry. In this paper, we argue for examining adversarial examples from the perspective of Bayes-Optimal classification. We construct realistic image datasets for which the Bayes-Optimal classifier can be efficiently computed and derive analytic conditions on the distributions under which these classifiers are provably robust against any adversarial attack even in high dimensions. Our results show that even when these "gold standard" optimal classifiers are robust, CNNs trained on the same datasets consistently learn a vulnerable classifier, indicating that adversarial examples are often an avoidable "bug". We further show that RBF SVMs trained on the same data consistently learn a robust classifier. The same trend is observed in experiments with real images in different datasets.

Comments:	Minor revision per journal review, 28 pages
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2002.08859 [cs.LG]
	(or arXiv:2002.08859v2 [cs.LG] for this version)

Submission history

From: Eitan Richardson [view email]
[v1] Thu, 20 Feb 2020 16:43:47 GMT (3653kb,D)
[v2] Wed, 17 Mar 2021 09:47:10 GMT (4333kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2002.08859

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: A Bayes-Optimal View on Adversarial Examples

Submission history