We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: Reliable Covariance Estimation

Abstract: Covariance or scatter matrix estimation is ubiquitous in most modern statistical and machine learning applications. The task becomes especially challenging since most real-world datasets are essentially non-Gaussian. The data is often contaminated by outliers and/or has heavy-tailed distribution causing the sample covariance to behave very poorly and calling for robust estimation methodology. The natural framework for the robust scatter matrix estimation is based on elliptical populations. Here, Tyler's estimator stands out by being distribution-free within the elliptical family and easy to compute. The existing works thoroughly study the performance of Tyler's estimator assuming ellipticity but without providing any tools to verify this assumption when the covariance is unknown in advance. We address the following open question: Given the sampled data and having no prior on the data generating process, how to assess the quality of the scatter matrix estimator? In this work we show that this question can be reformulated as an asymptotic uniformity test for certain sequences of exchangeable vectors on the unit sphere. We develop a consistent and easily applicable goodness-of-fit test against all alternatives to ellipticity when the scatter matrix is unknown. The findings are supported by numerical simulations demonstrating the power of the suggest technique.
Subjects: Statistics Theory (math.ST); Machine Learning (cs.LG)
Cite as: arXiv:2006.03311 [math.ST]
  (or arXiv:2006.03311v3 [math.ST] for this version)

Submission history

From: Ilya Soloveychik [view email]
[v1] Fri, 5 Jun 2020 08:51:16 GMT (16kb)
[v2] Sun, 14 Jun 2020 12:47:03 GMT (16kb)
[v3] Fri, 3 Jul 2020 16:06:25 GMT (121kb,D)
[v4] Fri, 14 Apr 2023 13:57:17 GMT (185kb,D)

Link back to: arXiv, form interface, contact.