We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo ScienceWISE logo

Statistics > Methodology

Title: High dimensional thresholded regression and shrinkage effect

Abstract: High-dimensional sparse modeling via regularization provides a powerful tool for analyzing large-scale data sets and obtaining meaningful, interpretable models. The use of nonconvex penalty functions shows advantage in selecting important features in high dimensions, but the global optimality of such methods still demands more understanding. In this paper, we consider sparse regression with hard-thresholding penalty, which we show to give rise to thresholded regression. This approach is motivated by its close connection with the $L_0$-regularization, which can be unrealistic to implement in practice but of appealing sampling properties, and its computational advantage. Under some mild regularity conditions allowing possibly exponentially growing dimensionality, we establish the oracle inequalities of the resulting regularized estimator, as the global minimizer, under various prediction and variable selection losses, as well as the oracle risk inequalities of the hard-thresholded estimator followed by a further $L_2$-regularization. The risk properties exhibit interesting shrinkage effects under both estimation and prediction losses. We identify the optimal choice of the ridge parameter, which is shown to have simultaneous advantages to both the $L_2$-loss and prediction loss. These new results and phenomena are evidenced by simulation and real data examples.
Comments: 23 pages, 3 figures, 5 tables
Subjects: Methodology (stat.ME); Machine Learning (stat.ML)
MSC classes: 62J07(Primary) 62F07, 62P10(Secondary)
Journal reference: Journal of the Royal Statistical Society Series B 76, 627-649
DOI: 10.1111/rssb.12037
Cite as: arXiv:1605.03306 [stat.ME]
  (or arXiv:1605.03306v1 [stat.ME] for this version)

Submission history

From: Jinchi Lv [view email]
[v1] Wed, 11 May 2016 07:12:07 GMT (63kb)

Link back to: arXiv, form interface, contact.