We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.CO

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Computation

Title: Computation of projection regression depth and its induced median

Authors: Yijun Zuo
Abstract: Notions of depth in regression have been introduced and studied in the literature. The most famous example is Regression Depth (RD), which is a direct extension of location depth to regression. The projection regression depth (PRD) is the extension of another prevailing location depth, the projection depth, to regression. The computation issues of the RD have been discussed in the literature. The computation issues of the PRD have never been dealt with before. The computation issues of the PRD and its induced median (maximum depth estimator) in a regression setting are addressed now. For a given $\bs{\beta}\in\R^p$ exact algorithms for the PRD with cost $O(n^2\log n)$ ($p=2$) and $O(N(n, p)(p^{3}+n\log n+np^{1.5}+npN_{Iter}))$ ($p>2$) and approximate algorithms for the PRD and its induced median with cost respectively $O(N_{\mb{v}}np)$ and $O(Rp N_{\bs{\beta}}(p^2+nN_{\mb{v}}N_{Iter}))$ are proposed. Here $N(n, p)$ is a number defined based on the total number of $(p-1)$ dimensional hyperplanes formed by points induced from sample points and the $\bs{\beta}$; $N_{\mb{v}}$ is the total number of unit directions $\mb{v}$ utilized; $N_{\bs{\beta}}$ is the total number of candidate regression parameters $\bs{\beta}$ employed; $N_{Iter}$ is the total number of iterations carried out in an optimization algorithm; $R$ is the total number of replications. Furthermore, as the second major contribution, three PRD induced estimators, which can be computed up to 30 times faster than that of the PRD induced median while maintaining a similar level of accuracy are introduced. Examples and simulation studies reveal that the depth median induced from the PRD is favorable in terms of robustness and efficiency, compared to the maximum depth estimator induced from the RD, which is the current leading regression median.
Comments: 33 pages and 6 figures and 9 tables
Subjects: Computation (stat.CO)
MSC classes: 62G08 (Primary), 62J05, 62J99 (Secondary)
Cite as: arXiv:1905.11846 [stat.CO]
  (or arXiv:1905.11846v5 [stat.CO] for this version)

Submission history

From: Yijun Zuo [view email]
[v1] Tue, 28 May 2019 14:27:59 GMT (67kb)
[v2] Mon, 11 Nov 2019 22:28:56 GMT (109kb)
[v3] Fri, 27 Mar 2020 18:03:07 GMT (114kb)
[v4] Sat, 12 Sep 2020 03:20:45 GMT (115kb)
[v5] Mon, 18 Jan 2021 15:03:26 GMT (122kb)

Link back to: arXiv, form interface, contact.