We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: A note on marginal correlation based screening

Abstract: Independence screening methods such as the two sample $t$-test and the marginal correlation based ranking are among the most widely used techniques for variable selection in ultrahigh dimensional data sets. In this short note, simple examples are used to demonstrate potential problems with the independence screening methods in the presence of correlated predictors. Also, an example is considered where all important variables are independent among themselves and all but one important variables are independent with the unimportant variables. Furthermore, a real data example from a genome wide association study is used to illustrate inferior performance of marginal correlation screening compared to another screening method.
Subjects: Methodology (stat.ME)
MSC classes: 62J05, 68Q32
Cite as: arXiv:1707.08143 [stat.ME]
  (or arXiv:1707.08143v3 [stat.ME] for this version)

Submission history

From: Vivekananda Roy [view email]
[v1] Tue, 25 Jul 2017 18:23:31 GMT (7kb)
[v2] Thu, 5 Nov 2020 02:16:44 GMT (69kb,D)
[v3] Mon, 16 Nov 2020 16:52:55 GMT (70kb,D)

Link back to: arXiv, form interface, contact.