We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.DB

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Computer Science > Databases

Title: Optimizing error of high-dimensional statistical queries under differential privacy

Abstract: Differentially private algorithms for answering sets of predicate counting queries on a sensitive database have many applications. Organizations that collect individual-level data, such as statistical agencies and medical institutions, use them to safely release summary tabulations. However, existing techniques are accurate only on a narrow class of query workloads, or are extremely slow, especially when analyzing more than one or two dimensions of the data. In this work we propose HDMM, a new differentially private algorithm for answering a workload of predicate counting queries, that is especially effective for higher-dimensional datasets. HDMM represents query workloads using an implicit matrix representation and exploits this compact representation to efficiently search (a subset of) the space of differentially private algorithms for one that answers the input query workload with high accuracy. We empirically show that HDMM can efficiently answer queries with lower error than state-of-the-art techniques on a variety of low and high dimensional datasets.
Subjects: Databases (cs.DB); Cryptography and Security (cs.CR)
Journal reference: PVLDB, 11 (10): 1206-1219, 2018
DOI: 10.14778/3231751.3231769
Cite as: arXiv:1808.03537 [cs.DB]
  (or arXiv:1808.03537v1 [cs.DB] for this version)

Submission history

From: Ryan McKenna [view email]
[v1] Fri, 10 Aug 2018 13:44:26 GMT (1422kb,D)

Link back to: arXiv, form interface, contact.