We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.AP

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Applications

Title: A Bayesian Semiparametric Approach to Learning About Gene-Gene Interactions in Case-Control Studies

Abstract: Gene-gene interactions are often regarded as playing significant roles in influencing variabilities of complex traits. Although much research has been devoted to this area, till date a comprehensive statistical model that adequately addresses the highly dependent structures associated with the interactions between the genes, multiple loci of every gene, various and unknown number of sub-populations that the subjects arise from, seem to be lacking. In this paper, we propose and develop a novel Bayesian semiparametric approach composed of finite mixtures based on Dirichlet processes and a hierarchical matrix-normal distribution that can comprehensively account for the unknown number of sub-populations and gene-gene interactions. Then, by formulating novel, and suitable Bayesian tests of hypotheses using a suitable metric for comparing clusterings in conjunction with the interaction parameters of the matrix-normal, we attempt to single out the roles of the genes, individually, and in interaction with other genes, in case-control studies. Quite importantly, we also attempt to identify the disease producing loci. Our model facilitates a highly efficient parallel computing methodology, combining Gibbs sampling and Transformation based MCMC (TMCMC). Application of our model and methodologies to biologically realistic data sets simulated from an existing, realistic population genetics model associated with case-control study, revealed quite encouraging performance. Quite importantly, we also applied our ideas to a real, myocardial infarction dataset, and obtained interesting results that partly agree with, and also complement, the existing works in this area, to reveal the importance of sophisticated and realistic modeling of gene-gene interactions.
Comments: Updated version including detailed Bayesian analysis of a real, case-control, genotype dataset on Myocardial Infarction (heart attack). Our results seem to be very interesting and insightful, and particularly demonstrate that perhaps there is no alternative to sophisticated, nonparametric Bayesian modeling explicitly accounting for gene-gene and SNP-SNP interactions
Subjects: Applications (stat.AP)
Cite as: arXiv:1411.7571 [stat.AP]
  (or arXiv:1411.7571v2 [stat.AP] for this version)

Submission history

From: Durba Bhattacharya ms [view email]
[v1] Thu, 27 Nov 2014 12:27:58 GMT (1198kb,D)
[v2] Thu, 19 May 2016 12:15:00 GMT (338kb,D)
[v3] Tue, 15 Nov 2016 17:22:12 GMT (1555kb,D)
[v4] Tue, 14 Mar 2017 17:14:03 GMT (1275kb,D)
[v5] Fri, 21 Jul 2017 09:17:29 GMT (1278kb,D)
[v6] Tue, 17 Apr 2018 12:57:03 GMT (1589kb,D)

Link back to: arXiv, form interface, contact.