We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Efficient Estimation of the Maximal Association between Multiple Predictors and a Survival Outcome

Abstract: This paper develops a new approach to post-selection inference for screening high-dimensional predictors of survival outcomes. Post-selection inference for right-censored outcome data has been investigated in the literature, but much remains to be done to make the methods both reliable and computationally-scalable in high-dimensions. Machine learning tools are commonly used to provide {\it predictions} of survival outcomes, but the estimated effect of a selected predictor suffers from confirmation bias unless the selection is taken into account. The new approach involves construction of semi-parametrically efficient estimators of the linear association between the predictors and the survival outcome, which are used to build a test statistic for detecting the presence of an association between any of the predictors and the outcome. Further, a stabilization technique reminiscent of bagging allows a normal calibration for the resulting test statistic, which enables the construction of confidence intervals for the maximal association between predictors and the outcome and also greatly reduces computational cost. Theoretical results show that this testing procedure is valid even when the number of predictors grows superpolynomially with sample size, and our simulations support that this asymptotic guarantee is indicative the performance of the test at moderate sample sizes. The new approach is applied to the problem of identifying patterns in viral gene expression associated with the potency of an antiviral drug.
Comments: 102 pages, 7 figures, 4 tables
Subjects: Methodology (stat.ME); Statistics Theory (math.ST); Other Statistics (stat.OT)
MSC classes: 62N03, 62G10, 62G20
Cite as: arXiv:2112.10996 [stat.ME]
  (or arXiv:2112.10996v1 [stat.ME] for this version)

Submission history

From: Tzu-Jung Huang [view email]
[v1] Tue, 21 Dec 2021 05:47:58 GMT (122kb,D)

Link back to: arXiv, form interface, contact.