We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.QM

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Quantitative Methods

Title: A Fast, Accurate Two-Step Linear Mixed Model for Genetic Analysis Applied to Repeat MRI Measurements

Abstract: Large-scale biobanks are being collected around the world in efforts to better understand human health and risk factors for disease. They often survey hundreds of thousands of individuals, combining questionnaires with clinical, genetic, demographic, and imaging assessments; some of this data may be collected longitudinally. Genetic associations analysis of such datasets requires methods to properly handle relatedness, population structure and other types of biases introduced by confounders. Most popular and accurate approaches rely on linear mixed model (LMM) algorithms, which are iterative and computational complexity of each iteration scales by the square of the sample size, slowing the pace of discoveries (up to several days for single trait analysis), and, furthermore, limiting the use of repeat phenotypic measurements. Here, we describe our new, non-iterative, much faster and accurate Two-Step Linear Mixed Model (Two-Step LMM) approach, that has a computational complexity that scales linearly with sample size. We show that the first step retains accurate estimates of the heritability (the proportion of the trait variance explained by additive genetic factors), even when increasingly complex genetic relationships between individuals are modeled. Second step provides a faster framework to obtain the effect sizes of covariates in regression model. We applied Two-Step LMM to real data from the UK Biobank, which recently released genotyping information and processed MRI data from 9,725 individuals. We used the left and right hippocampus volume (HV) as repeated measures, and observed increased and more accurate heritability estimation, consistent with simulations.
Comments: 2017 Neural Information Processing Systems (NeurIPS) BigNeuro Workshop
Subjects: Quantitative Methods (q-bio.QM); Applications (stat.AP)
Cite as: arXiv:1710.10641 [q-bio.QM]
  (or arXiv:1710.10641v4 [q-bio.QM] for this version)

Submission history

From: Qifan Yang [view email]
[v1] Sun, 29 Oct 2017 16:24:40 GMT (953kb)
[v2] Thu, 7 Dec 2017 13:07:09 GMT (953kb)
[v3] Fri, 15 Feb 2019 01:42:45 GMT (2613kb)
[v4] Fri, 15 Mar 2019 19:01:43 GMT (2640kb)

Link back to: arXiv, form interface, contact.