We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Hybrid safe-strong rules for efficient optimization in lasso-type problems

Abstract: The lasso model has been widely used for model selection in data mining, machine learning, and high-dimensional statistical analysis. However, with the ultrahigh-dimensional, large-scale data sets now collected in many real-world applications, it is important to develop algorithms to solve the lasso that efficiently scale up to problems of this size. Discarding features from certain steps of the algorithm is a powerful technique for increasing efficiency and addressing the Big Data challenge. In this paper, we propose a family of hybrid safe-strong rules (HSSR) which incorporate safe screening rules into the sequential strong rule (SSR) to remove unnecessary computational burden. In particular, we present two instances of HSSR, namely SSR-Dome and SSR-BEDPP, for the standard lasso problem. We further extend SSR-BEDPP to the elastic net and group lasso problems to demonstrate the generalizability of the hybrid screening idea. Extensive numerical experiments with synthetic and real data sets are conducted for both the standard lasso and the group lasso problems. Results show that our proposed hybrid rules can substantially outperform existing state-of-the-art rules.
Comments: 31 pages, 4 figures
Subjects: Machine Learning (stat.ML); Computation (stat.CO)
Cite as: arXiv:1704.08742 [stat.ML]
  (or arXiv:1704.08742v3 [stat.ML] for this version)

Submission history

From: Patrick Breheny [view email]
[v1] Thu, 27 Apr 2017 20:53:16 GMT (250kb,D)
[v2] Tue, 21 Nov 2017 19:41:25 GMT (45kb,D)
[v3] Mon, 1 Jun 2020 16:27:57 GMT (55kb,D)

Link back to: arXiv, form interface, contact.