We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math.ST

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Mathematics > Statistics Theory

Title: Multivariate Location and Scatter Matrix Estimation Under Cellwise and Casewise Contamination

Abstract: We consider the problem of multivariate location and scatter matrix estimation when the data contain cellwise and casewise outliers. Agostinelli et al. (2015) propose a two-step approach to deal with this problem: first, apply a univariate filter to remove cellwise outliers and second, apply a generalized S-estimator to downweight casewise outliers. We improve this proposal in three main directions. First, we introduce a consistent bivariate filter to be used in combination with the univariate filter in the first step. Second, we propose a new fast subsampling procedure to generate starting points for the generalized S-estimator in the second step. Third, we consider a non-monotonic weight function for the generalized S-estimator to better deal with casewise outliers in high dimension. A simulation study and real data example show that, unlike the original two-step procedure, the modified two-step approach performs and scales well for high dimension. Moreover, the modified procedure outperforms the original one and other state-of-the-art robust procedures under cellwise and casewise data contamination.
Subjects: Statistics Theory (math.ST)
MSC classes: 62G35, 62G05, 62G20
Cite as: arXiv:1609.00402 [math.ST]
  (or arXiv:1609.00402v2 [math.ST] for this version)

Submission history

From: Andy Leung [view email]
[v1] Thu, 1 Sep 2016 21:03:49 GMT (104kb,D)
[v2] Sun, 25 Dec 2016 10:36:00 GMT (108kb,D)

Link back to: arXiv, form interface, contact.