We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

cs.LG

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Quantitative Biology > Biomolecules

Title: Hierarchical, rotation-equivariant neural networks to select structural models of protein complexes

Abstract: Predicting the structure of multi-protein complexes is a grand challenge in biochemistry, with major implications for basic science and drug discovery. Computational structure prediction methods generally leverage pre-defined structural features to distinguish accurate structural models from less accurate ones. This raises the question of whether it is possible to learn characteristics of accurate models directly from atomic coordinates of protein complexes, with no prior assumptions. Here we introduce a machine learning method that learns directly from the 3D positions of all atoms to identify accurate models of protein complexes, without using any pre-computed physics-inspired or statistical terms. Our neural network architecture combines multiple ingredients that together enable end-to-end learning from molecular structures containing tens of thousands of atoms: a point-based representation of atoms, equivariance with respect to rotation and translation, local convolutions, and hierarchical subsampling operations. When used in combination with previously developed scoring functions, our network substantially improves the identification of accurate structural models among a large set of possible models. Our network can also be used to predict the accuracy of a given structural model in absolute terms. The architecture we present is readily applicable to other tasks involving learning on 3D structures of large atomic systems.
Comments: 11 pages, 5 figures + SI: Updated based on the published version in PROTEINS. Presented at NeurIPS 2019 workshop Learning Meaningful Representations of Life
Subjects: Biomolecules (q-bio.BM); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
DOI: 10.1002/prot.26033
Cite as: arXiv:2006.09275 [q-bio.BM]
  (or arXiv:2006.09275v2 [q-bio.BM] for this version)

Submission history

From: Stephan Eismann [view email]
[v1] Fri, 5 Jun 2020 20:17:12 GMT (1649kb,D)
[v2] Sat, 23 Jan 2021 00:47:10 GMT (5716kb,D)

Link back to: arXiv, form interface, contact.