Protecting Classifiers From Attacks. A Bayesian Approach

Gallego, Victor; Naveiro, Roi; Redondo, Alberto; Insua, David Rios; Ruggeri, Fabrizio

Full-text links:

Download:

Current browse context:

stat

< prev | next >

new | recent | 2004

Statistics > Machine Learning

Title: Protecting Classifiers From Attacks. A Bayesian Approach

Authors: Victor Gallego, Roi Naveiro, Alberto Redondo, David Rios Insua, Fabrizio Ruggeri

(Submitted on 18 Apr 2020)

Abstract: Classification problems in security settings are usually modeled as confrontations in which an adversary tries to fool a classifier manipulating the covariates of instances to obtain a benefit. Most approaches to such problems have focused on game-theoretic ideas with strong underlying common knowledge assumptions, which are not realistic in the security realm. We provide an alternative Bayesian framework that accounts for the lack of precise knowledge about the attacker's behavior using adversarial risk analysis. A key ingredient required by our framework is the ability to sample from the distribution of originating instances given the possibly attacked observed one. We propose a sampling procedure based on approximate Bayesian computation, in which we simulate the attacker's problem taking into account our uncertainty about his elements. For large scale problems, we propose an alternative, scalable approach that could be used when dealing with differentiable classifiers. Within it, we move the computational load to the training phase, simulating attacks from an adversary, adapting the framework to obtain a classifier robustified against attacks.

Subjects:	Machine Learning (stat.ML); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Computation (stat.CO)
Cite as:	arXiv:2004.08705 [stat.ML]
	(or arXiv:2004.08705v1 [stat.ML] for this version)

Submission history

From: Victor Gallego [view email]
[v1] Sat, 18 Apr 2020 21:21:56 GMT (798kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> stat > arXiv:2004.08705

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Statistics > Machine Learning

Title: Protecting Classifiers From Attacks. A Bayesian Approach

Submission history