We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: High-Dimensional Differentially-Private EM Algorithm: Methods and Near-Optimal Statistical Guarantees

Abstract: In this paper, we develop a general framework to design differentially private expectation-maximization (EM) algorithms in high-dimensional latent variable models, based on the noisy iterative hard-thresholding. We derive the statistical guarantees of the proposed framework and apply it to three specific models: Gaussian mixture, mixture of regression, and regression with missing covariates. In each model, we establish the near-optimal rate of convergence with differential privacy constraints, and show the proposed algorithm is minimax rate optimal up to logarithm factors. The technical tools developed for the high-dimensional setting are then extended to the classic low-dimensional latent variable models, and we propose a near rate-optimal EM algorithm with differential privacy guarantees in this setting. Simulation studies and real data analysis are conducted to support our results.
Comments: 68 pages, 3 figures
Subjects: Machine Learning (stat.ML); Cryptography and Security (cs.CR); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
Cite as: arXiv:2104.00245 [stat.ML]
  (or arXiv:2104.00245v2 [stat.ML] for this version)

Submission history

From: Zhe Zhang [view email]
[v1] Thu, 1 Apr 2021 04:08:34 GMT (490kb,D)
[v2] Thu, 9 Sep 2021 03:27:39 GMT (466kb,D)

Link back to: arXiv, form interface, contact.