We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

math

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Mathematics > Statistics Theory

Title: On testing the significance of sets of genes

Abstract: This paper discusses the problem of identifying differentially expressed groups of genes from a microarray experiment. The groups of genes are externally defined, for example, sets of gene pathways derived from biological databases. Our starting point is the interesting Gene Set Enrichment Analysis (GSEA) procedure of Subramanian et al. [Proc. Natl. Acad. Sci. USA 102 (2005) 15545--15550]. We study the problem in some generality and propose two potential improvements to GSEA: the maxmean statistic for summarizing gene-sets, and restandardization for more accurate inferences. We discuss a variety of examples and extensions, including the use of gene-set scores for class predictions. We also describe a new R language package GSA that implements our ideas.
Comments: Published at this http URL in the Annals of Applied Statistics (this http URL) by the Institute of Mathematical Statistics (this http URL)
Subjects: Statistics Theory (math.ST); Molecular Networks (q-bio.MN); Applications (stat.AP)
Journal reference: Annals of Applied Statistics 2007, Vol. 1, No. 1, 107-129
DOI: 10.1214/07-AOAS101
Report number: IMS-AOAS-AOAS101
Cite as: arXiv:math/0610667 [math.ST]
  (or arXiv:math/0610667v2 [math.ST] for this version)

Submission history

From: Rob Tibshirani [view email]
[v1] Sun, 22 Oct 2006 23:44:00 GMT (57kb)
[v2] Tue, 4 Sep 2007 05:25:57 GMT (312kb,S)

Link back to: arXiv, form interface, contact.