We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.AP

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Applications

Title: High-dimensional Feature Selection Using Hierarchical Bayesian Logistic Regression with Heavy-tailed Priors

Abstract: The problem of selecting the most useful features from a great many (eg, thousands) of candidates arises in many areas of modern sciences. An interesting problem from genomic research is that, from thousands of genes that are active (expressed) in certain tissue cells, we want to find the genes that can be used to separate tissues of different classes (eg. cancer and normal). In this paper, we report our empirical experiences of using Bayesian logistic regression based on heavy-tailed priors with moderately small degree freedom (such as 1) and very small scale, and using Hamiltonian Monte Carlo to do computation. We discuss the advantages and limitations of this method, and illustrate the difficulties that remain unsolved. The method is applied to a real microarray data set related to prostate cancer. The method identifies only 3 non-redundant genes out of 6033 candidates but achieves better leave-one-out cross-validated prediction accuracy than many other methods.
Comments: This is an earlier version of the paper arXiv:1405.3319. We do not want to cause confusion to readers
Subjects: Applications (stat.AP); Computation (stat.CO)
Cite as: arXiv:1308.4690 [stat.AP]
  (or arXiv:1308.4690v2 [stat.AP] for this version)

Submission history

From: Longhai Li [view email]
[v1] Wed, 21 Aug 2013 20:04:34 GMT (381kb)
[v2] Sat, 12 May 2018 03:12:27 GMT (0kb,I)

Link back to: arXiv, form interface, contact.