We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.CR

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Cryptography and Security

Title: Synthetic Data: Methods, Use Cases, and Risks

Abstract: Sharing data can often enable compelling applications and analytics. However, more often than not, valuable datasets contain information of a sensitive nature, and thus, sharing them can endanger the privacy of users and organizations. A possible alternative gaining momentum in both the research community and industry is to share synthetic data instead. The idea is to release artificially generated datasets that resemble the actual data -- more precisely, having similar statistical properties. In this article, we provide a gentle introduction to synthetic data and discuss its use cases, the privacy challenges that are still unaddressed, and its inherent limitations as an effective privacy-enhancing technology.
Comments: To Appear in IEEE Security and Privacy Magazine
Subjects: Cryptography and Security (cs.CR); Artificial Intelligence (cs.AI); Computers and Society (cs.CY)
Cite as: arXiv:2303.01230 [cs.CR]
  (or arXiv:2303.01230v3 [cs.CR] for this version)

Submission history

From: Emiliano De Cristofaro [view email]
[v1] Wed, 1 Mar 2023 16:35:33 GMT (706kb,D)
[v2] Mon, 6 Mar 2023 09:09:17 GMT (706kb,D)
[v3] Tue, 27 Feb 2024 07:03:15 GMT (706kb,D)

Link back to: arXiv, form interface, contact.