We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Weighted SAMGSR: combining significance analysis of microarray-gene set reduction algorithm with pathway topology-based weights to select relevant genes

Abstract: Introduction
It has been demonstrated that a pathway-based feature selection method which incorporates biological information within pathways into the process of feature selection usually outperform a gene-based feature selection algorithm in terms of predictive accuracy, stability, and biological interpretation. Significance analysis of microarray-gene set reduction algorithm (SAMGSR), an extension to a gene set analysis method with further reduction of the selected pathways to their respective core subsets, can be regarded as a pathway-based feature selection method.
Results and Discussion
In SAMGSR, whether a gene is selected is mainly determined by its expression difference between the phenotypes, and partially by the number of pathways to which this gene belongs, but ignoring the topology information among pathways. In this study, we propose a weighted version of the SAMGSR algorithm by constructing weights based on the connectivity among genes and then incorporating these weights in the test statistic.
Conclusions
Using both simulated and real-world data, we evaluate the performance of the proposed SAMGSR extension and demonstrate that gene connectivity is indeed informative for feature selection.
Subjects: Methodology (stat.ME)
Cite as: arXiv:1605.03697 [stat.ME]
  (or arXiv:1605.03697v1 [stat.ME] for this version)

Submission history

From: Suyan Tian [view email]
[v1] Thu, 12 May 2016 07:11:51 GMT (1181kb)

Link back to: arXiv, form interface, contact.