We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

q-bio.QM

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Quantitative Methods

Title: Deep generative models of genetic variation capture mutation effects

Abstract: The functions of proteins and RNAs are determined by a myriad of interactions between their constituent residues, but most quantitative models of how molecular phenotype depends on genotype must approximate this by simple additive effects. While recent models have relaxed this constraint to also account for pairwise interactions, these approaches do not provide a tractable path towards modeling higher-order dependencies. Here, we show how latent variable models with nonlinear dependencies can be applied to capture beyond-pairwise constraints in biomolecules. We present a new probabilistic model for sequence families, DeepSequence, that can predict the effects of mutations across a variety of deep mutational scanning experiments significantly better than site independent or pairwise models that are based on the same evolutionary data. The model, learned in an unsupervised manner solely from sequence information, is grounded with biologically motivated priors, reveals latent organization of sequence families, and can be used to extrapolate to new parts of sequence space
Subjects: Quantitative Methods (q-bio.QM); Disordered Systems and Neural Networks (cond-mat.dis-nn); Biological Physics (physics.bio-ph); Machine Learning (stat.ML)
Cite as: arXiv:1712.06527 [q-bio.QM]
  (or arXiv:1712.06527v1 [q-bio.QM] for this version)

Submission history

From: Debora Marks [view email]
[v1] Mon, 18 Dec 2017 17:13:08 GMT (4177kb)

Link back to: arXiv, form interface, contact.