Efficient and Robust Classification for Sparse Attacks

Beliaev, Mark; Delgosha, Payam; Hassani, Hamed; Pedarsani, Ramtin

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2201

Computer Science > Machine Learning

Title: Efficient and Robust Classification for Sparse Attacks

Authors: Mark Beliaev, Payam Delgosha, Hamed Hassani, Ramtin Pedarsani

(Submitted on 23 Jan 2022)

Abstract: In the past two decades we have seen the popularity of neural networks increase in conjunction with their classification accuracy. Parallel to this, we have also witnessed how fragile the very same prediction models are: tiny perturbations to the inputs can cause misclassification errors throughout entire datasets. In this paper, we consider perturbations bounded by the $\ell_0$--norm, which have been shown as effective attacks in the domains of image-recognition, natural language processing, and malware-detection. To this end, we propose a novel defense method that consists of "truncation" and "adversarial training". We then theoretically study the Gaussian mixture setting and prove the asymptotic optimality of our proposed classifier. Motivated by the insights we obtain, we extend these components to neural network classifiers. We conduct numerical experiments in the domain of computer vision using the MNIST and CIFAR datasets, demonstrating significant improvement for the robust classification error of neural networks.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Cite as:	arXiv:2201.09369 [cs.LG]
	(or arXiv:2201.09369v1 [cs.LG] for this version)

Submission history

From: Mark Beliaev [view email]
[v1] Sun, 23 Jan 2022 21:18:17 GMT (434kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2201.09369

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Efficient and Robust Classification for Sparse Attacks

Submission history