Scalable Backdoor Detection in Neural Networks

Harikumar, Haripriya; Le, Vuong; Rana, Santu; Bhattacharya, Sourangshu; Gupta, Sunil; Venkatesh, Svetha

Full-text links:

Download:

Current browse context:

cs.CV

< prev | next >

new | recent | 2006

Change to browse by:

Computer Science > Computer Vision and Pattern Recognition

Title: Scalable Backdoor Detection in Neural Networks

Authors: Haripriya Harikumar, Vuong Le, Santu Rana, Sourangshu Bhattacharya, Sunil Gupta, Svetha Venkatesh

(Submitted on 10 Jun 2020)

Abstract: Recently, it has been shown that deep learning models are vulnerable to Trojan attacks, where an attacker can install a backdoor during training time to make the resultant model misidentify samples contaminated with a small trigger patch. Current backdoor detection methods fail to achieve good detection performance and are computationally expensive. In this paper, we propose a novel trigger reverse-engineering based approach whose computational complexity does not scale with the number of labels, and is based on a measure that is both interpretable and universal across different network and patch types. In experiments, we observe that our method achieves a perfect score in separating Trojaned models from pure models, which is an improvement over the current state-of-the art method.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2006.05646 [cs.CV]
	(or arXiv:2006.05646v1 [cs.CV] for this version)

Submission history

From: Haripriya Harikumar [view email]
[v1] Wed, 10 Jun 2020 04:12:53 GMT (3514kb)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2006.05646

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computer Vision and Pattern Recognition

Title: Scalable Backdoor Detection in Neural Networks

Submission history