We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Sparse regression and marginal testing using cluster prototypes

Abstract: We propose a new approach for sparse regression and marginal testing, for data with correlated features. Our procedure first clusters the features, and then chooses as the cluster prototype the most informative feature in that cluster. Then we apply either sparse regression (lasso) or marginal significance testing to these prototypes. While this kind of strategy is not entirely new, a key feature of our proposal is its use of the post-selection inference theory of Taylor et al. (2014) and Lee et al. (2014) to compute exact p-values and confidence intervals that properly account for the selection of prototypes.
We also apply the recent "knockoff" idea of Barber and Cand\`es to provide exact finite sample control of the FDR of our regression procedure. We illustrate our proposals on both real and simulated data.
Comments: 43 pages, 19 figures
Subjects: Methodology (stat.ME)
Cite as: arXiv:1503.00334 [stat.ME]
  (or arXiv:1503.00334v2 [stat.ME] for this version)

Submission history

From: Stephen Reid [view email]
[v1] Sun, 1 Mar 2015 19:11:03 GMT (573kb,D)
[v2] Fri, 13 Mar 2015 19:12:31 GMT (574kb,D)

Link back to: arXiv, form interface, contact.