We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Gaining Outlier Resistance with Progressive Quantiles: Fast Algorithms and Theoretical Studies

Abstract: Outliers widely occur in big-data applications and may severely affect statistical estimation and inference. In this paper, a framework of outlier-resistant estimation is introduced to robustify an arbitrarily given loss function. It has a close connection to the method of trimming and includes explicit outlyingness parameters for all samples, which in turn facilitates computation, theory, and parameter tuning. To tackle the issues of nonconvexity and nonsmoothness, we develop scalable algorithms with implementation ease and guaranteed fast convergence. In particular, a new technique is proposed to alleviate the requirement on the starting point such that on regular datasets, the number of data resamplings can be substantially reduced. Based on combined statistical and computational treatments, we are able to perform nonasymptotic analysis beyond M-estimation. The obtained resistant estimators, though not necessarily globally or even locally optimal, enjoy minimax rate optimality in both low dimensions and high dimensions. Experiments in regression, classification, and neural networks show excellent performance of the proposed methodology at the occurrence of gross outliers.
Subjects: Methodology (stat.ME); Statistics Theory (math.ST); Machine Learning (stat.ML)
Cite as: arXiv:2112.08471 [stat.ME]
  (or arXiv:2112.08471v3 [stat.ME] for this version)

Submission history

From: Yiyuan She [view email]
[v1] Wed, 15 Dec 2021 20:35:21 GMT (629kb,D)
[v2] Mon, 7 Nov 2022 20:59:49 GMT (267kb,D)
[v3] Tue, 18 Apr 2023 18:10:24 GMT (267kb,D)

Link back to: arXiv, form interface, contact.