Differentially Private Representation for NLP: Formal Guarantee and An Empirical Study on Privacy and Fairness

Lyu, Lingjuan; He, Xuanli; Li, Yitong

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2010

Computer Science > Machine Learning

Title: Differentially Private Representation for NLP: Formal Guarantee and An Empirical Study on Privacy and Fairness

Authors: Lingjuan Lyu, Xuanli He, Yitong Li

(Submitted on 3 Oct 2020)

Abstract: It has been demonstrated that hidden representation learned by a deep model can encode private information of the input, hence can be exploited to recover such information with reasonable accuracy. To address this issue, we propose a novel approach called Differentially Private Neural Representation (DPNR) to preserve the privacy of the extracted representation from text. DPNR utilises Differential Privacy (DP) to provide a formal privacy guarantee. Further, we show that masking words via dropout can further enhance privacy. To maintain utility of the learned representation, we integrate DP-noisy representation into a robust training process to derive a robust target model, which also helps for model fairness over various demographic variables. Experimental results on benchmark datasets under various parameter settings demonstrate that DPNR largely reduces privacy leakage without significantly sacrificing the main task performance.

Comments:	accepted to Findings of EMNLP 2020
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:2010.01285 [cs.LG]
	(or arXiv:2010.01285v1 [cs.LG] for this version)

Submission history

From: Xuanli He [view email]
[v1] Sat, 3 Oct 2020 05:58:32 GMT (656kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2010.01285

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Differentially Private Representation for NLP: Formal Guarantee and An Empirical Study on Privacy and Fairness

Submission history