Current browse context:
cs.CR
Change to browse by:
References & Citations
Computer Science > Cryptography and Security
Title: Synthetic Data: Methods, Use Cases, and Risks
(Submitted on 1 Mar 2023 (v1), last revised 27 Feb 2024 (this version, v3))
Abstract: Sharing data can often enable compelling applications and analytics. However, more often than not, valuable datasets contain information of a sensitive nature, and thus, sharing them can endanger the privacy of users and organizations. A possible alternative gaining momentum in both the research community and industry is to share synthetic data instead. The idea is to release artificially generated datasets that resemble the actual data -- more precisely, having similar statistical properties. In this article, we provide a gentle introduction to synthetic data and discuss its use cases, the privacy challenges that are still unaddressed, and its inherent limitations as an effective privacy-enhancing technology.
Submission history
From: Emiliano De Cristofaro [view email][v1] Wed, 1 Mar 2023 16:35:33 GMT (706kb,D)
[v2] Mon, 6 Mar 2023 09:09:17 GMT (706kb,D)
[v3] Tue, 27 Feb 2024 07:03:15 GMT (706kb,D)
Link back to: arXiv, form interface, contact.