We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.AP

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Applications

Title: Robust Mean Estimation in High Dimensions: An Outlier Fraction Agnostic and Efficient Algorithm

Abstract: The problem of robust mean estimation in high dimensions is studied, in which a certain fraction (less than half) of the datapoints can be arbitrarily corrupted. Motivated by compressive sensing, the robust mean estimation problem is formulated as the minimization of the $\ell_0$-`norm' of an \emph{outlier indicator vector}, under a second moment constraint on the datapoints. The $\ell_0$-`norm' is then relaxed to the $\ell_p$-norm ($0<p\leq 1$) in the objective, and it is shown that the global minima for each of these objectives are order-optimal and have optimal breakdown point for the robust mean estimation problem. Furthermore, a computationally tractable iterative $\ell_p$-minimization and hard thresholding algorithm is proposed that outputs an order-optimal robust estimate of the population mean. The proposed algorithm (with breakdown point $\approx 0.3$) does not require prior knowledge of the fraction of outliers, in contrast with most existing algorithms, and for $p=1$ it has near-linear time complexity. Both synthetic and real data experiments demonstrate that the proposed algorithm outperforms state-of-the-art robust mean estimation methods.
Comments: arXiv admin note: text overlap with arXiv:2008.09239
Subjects: Applications (stat.AP); Information Theory (cs.IT)
Cite as: arXiv:2102.08573 [stat.AP]
  (or arXiv:2102.08573v5 [stat.AP] for this version)

Submission history

From: Aditya Deshmukh [view email]
[v1] Wed, 17 Feb 2021 04:45:49 GMT (349kb,D)
[v2] Thu, 20 Jan 2022 18:44:55 GMT (340kb,D)
[v3] Mon, 24 Jan 2022 16:38:24 GMT (340kb,D)
[v4] Mon, 4 Apr 2022 14:18:50 GMT (329kb,D)
[v5] Wed, 7 Dec 2022 18:35:37 GMT (333kb,D)

Link back to: arXiv, form interface, contact.