We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Simultaneous Detection of Signal Regions Using Quadratic Scan Statistics With Applications in Whole Genome Association Studies

Abstract: We consider in this paper detection of signal regions associated with disease outcomes in whole genome association studies. Gene- or region-based methods have become increasingly popular in whole genome association analysis as a complementary approach to traditional individual variant analysis. However, these methods test for the association between an outcome and the genetic variants in a pre-specified region, e.g., a gene. In view of massive intergenic regions in whole genome sequencing (WGS) studies, we propose a computationally efficient quadratic scan (Q-SCAN) statistic based method to detect the existence and the locations of signal regions by scanning the genome continuously. The proposed method accounts for the correlation (linkage disequilibrium) among genetic variants, and allows for signal regions to have both causal and neutral variants, and the effects of signal variants to be in different directions. We study the asymptotic properties of the proposed Q-SCAN statistics. We derive an empirical threshold that controls for the family-wise error rate, and show that under regularity conditions the proposed method consistently selects the true signal regions. We perform simulation studies to evaluate the finite sample performance of the proposed method. Our simulation results show that the proposed procedure outperforms the existing methods, especially when signal regions have causal variants whose effects are in different directions, or are contaminated with neutral variants. We illustrate Q-SCAN by analyzing the WGS data from the Atherosclerosis Risk in Communities (ARIC) study.
Subjects: Methodology (stat.ME)
Journal reference: Journal of American Statistical Association (2020)
DOI: 10.1080/01621459.2020.1822849
Cite as: arXiv:1710.05021 [stat.ME]
  (or arXiv:1710.05021v4 [stat.ME] for this version)

Submission history

From: Zilin Li [view email]
[v1] Fri, 13 Oct 2017 17:51:46 GMT (705kb,D)
[v2] Mon, 16 Oct 2017 16:55:11 GMT (1839kb,AD)
[v3] Thu, 12 Jul 2018 02:45:53 GMT (1381kb,AD)
[v4] Thu, 25 Jul 2019 06:28:29 GMT (450kb,D)

Link back to: arXiv, form interface, contact.