We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Solar: a least-angle regression for stable variable selection in high-dimensional spaces

Abstract: We propose a new algorithm for variable selection in high-dimensional data, called subsample-ordered least-angle regression (solar). Solar relies on the average $L_0$ solution path computed across subsamples and alleviates several known high-dimensional issues with lasso and least-angle regression. We illustrate in simulations that, with the same computation load, solar yields substantial improvements over lasso in terms of the sparsity (37-64\% reduction in the average number of selected variables), stability and accuracy of variable selection. Moreover, solar supplemented with the hold-out average (an adaptation of classical post-OLS tests) successfully purges almost all of the redundant variables while retaining all of the informative variables. Using simulations and real-world data, we also illustrate numerically that sparse solar variable selection is robust to complicated dependence structures and harsh settings of the irrepresentable condition. Moreover, replacing lasso with solar in an ensemble system (e.g., the bootstrap ensemble), significantly reduces the computation load (at least 96\% fewer subsample repetitions) of the bootstrap ensemble and improves selection sparsity. We provide a Python parallel computing package for solar (solarpy) in the supplementary file and this https URL
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as: arXiv:2007.15707 [stat.ML]
  (or arXiv:2007.15707v2 [stat.ML] for this version)

Submission history

From: Ning Xu [view email]
[v1] Thu, 30 Jul 2020 19:45:59 GMT (2533kb,D)
[v2] Mon, 26 Apr 2021 09:49:08 GMT (1371kb,D)
[v3] Fri, 6 May 2022 02:08:33 GMT (2273kb,D)

Link back to: arXiv, form interface, contact.