An Extension of Fano's Inequality for Characterizing Model Susceptibility to Membership Inference Attacks

Jha, Sumit Kumar; Jha, Susmit; Ewetz, Rickard; Raj, Sunny; Velasquez, Alvaro; Pullum, Laura L.; Swami, Ananthram

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2009

Computer Science > Machine Learning

Title: An Extension of Fano's Inequality for Characterizing Model Susceptibility to Membership Inference Attacks

Authors: Sumit Kumar Jha, Susmit Jha, Rickard Ewetz, Sunny Raj, Alvaro Velasquez, Laura L. Pullum, Ananthram Swami

(Submitted on 17 Sep 2020)

Abstract: Deep neural networks have been shown to be vulnerable to membership inference attacks wherein the attacker aims to detect whether specific input data were used to train the model. These attacks can potentially leak private or proprietary data. We present a new extension of Fano's inequality and employ it to theoretically establish that the probability of success for a membership inference attack on a deep neural network can be bounded using the mutual information between its inputs and its activations. This enables the use of mutual information to measure the susceptibility of a DNN model to membership inference attacks. In our empirical evaluation, we show that the correlation between the mutual information and the susceptibility of the DNN model to membership inference attacks is 0.966, 0.996, and 0.955 for CIFAR-10, SVHN and GTSRB models, respectively.

Comments:	9 pages, 3 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
ACM classes:	I.2.0
Cite as:	arXiv:2009.08097 [cs.LG]
	(or arXiv:2009.08097v1 [cs.LG] for this version)

Submission history

From: Sumit Kumar Jha [view email]
[v1] Thu, 17 Sep 2020 06:37:15 GMT (344kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2009.08097

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: An Extension of Fano's Inequality for Characterizing Model Susceptibility to Membership Inference Attacks

Submission history