Current browse context:
stat.ME
Change to browse by:
References & Citations
Statistics > Methodology
Title: Testing High Dimensional Differential Matrices, with Application to Detecting Schizophrenia Risk Genes
(Submitted on 1 Jun 2016 (v1), revised 26 Sep 2016 (this version, v2), latest version 7 Dec 2016 (v4))
Abstract: Scientists routinely compare gene expression levels in cases versus controls in part to determine genes associated with a disease. Similarly, detecting case-control differences in co-expression among genes can be critical to understanding complex human diseases, however statistical methods have been limited by the high dimensional nature of this problem. In this paper, we construct a sparse-Leading-Eigenvalue-Driven (sLED) test for high-dimensional differential matrices, defined as the difference of gene-gene "relationship" matrices in two populations. sLED encompasses the traditional two-sample covariance test as a special case, but it can also be applied to more general scenarios such as comparing weighted adjacency matrices. By focusing on the spectrum of the differential matrix, sLED provides a novel perspective that accommodates the sparse and weak signals in many gene expression data and is closely related with Sparse Principal Component Analysis. When testing two-sample high dimensional covariance matrices, sLED achieves full power asymptotically under mild assumptions, and simulation studies verify that it outperforms other existing procedures for many biologically plausible scenarios. Applying sLED to the largest gene-expression dataset comparing Schizophrenia and control brains, we provide a novel list of risk genes and reveal intriguing patterns in gene co-expression change for Schizophrenia subjects.
Submission history
From: Lingxue Zhu [view email][v1] Wed, 1 Jun 2016 12:30:19 GMT (4445kb,D)
[v2] Mon, 26 Sep 2016 06:11:53 GMT (4445kb,D)
[v3] Tue, 22 Nov 2016 04:17:42 GMT (1803kb,D)
[v4] Wed, 7 Dec 2016 19:17:08 GMT (1803kb,D)
Link back to: arXiv, form interface, contact.