Current browse context:
cs.CV
Change to browse by:
References & Citations
Computer Science > Machine Learning
Title: On the Privacy Effect of Data Enhancement via the Lens of Memorization
(Submitted on 17 Aug 2022 (v1), last revised 23 Mar 2024 (this version, v4))
Abstract: Machine learning poses severe privacy concerns as it has been shown that the learned models can reveal sensitive information about their training data. Many works have investigated the effect of widely adopted data augmentation and adversarial training techniques, termed data enhancement in the paper, on the privacy leakage of machine learning models. Such privacy effects are often measured by membership inference attacks (MIAs), which aim to identify whether a particular example belongs to the training set or not. We propose to investigate privacy from a new perspective called memorization. Through the lens of memorization, we find that previously deployed MIAs produce misleading results as they are less likely to identify samples with higher privacy risks as members compared to samples with low privacy risks. To solve this problem, we deploy a recent attack that can capture individual samples' memorization degrees for evaluation. Through extensive experiments, we unveil several findings about the connections between three essential properties of machine learning models, including privacy, generalization gap, and adversarial robustness. We demonstrate that the generalization gap and privacy leakage are less correlated than those of the previous results. Moreover, there is not necessarily a trade-off between adversarial robustness and privacy as stronger adversarial robustness does not make the model more susceptible to privacy attacks.
Submission history
From: Xiao Li [view email][v1] Wed, 17 Aug 2022 13:02:17 GMT (312kb,D)
[v2] Tue, 28 Feb 2023 06:17:36 GMT (1173kb,D)
[v3] Wed, 20 Mar 2024 14:13:44 GMT (423kb,D)
[v4] Sat, 23 Mar 2024 03:33:32 GMT (423kb,D)
Link back to: arXiv, form interface, contact.