We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ML

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Machine Learning

Title: Molecular Graph Convolutions: Moving Beyond Fingerprints

Abstract: Molecular "fingerprints" encoding structural information are the workhorse of cheminformatics and machine learning in drug discovery applications. However, fingerprint representations necessarily emphasize particular aspects of the molecular structure while ignoring others, rather than allowing the model to make data-driven decisions. We describe molecular graph convolutions, a machine learning architecture for learning from undirected graphs, specifically small molecules. Graph convolutions use a simple encoding of the molecular graph---atoms, bonds, distances, etc.---which allows the model to take greater advantage of information in the graph structure. Although graph convolutions do not outperform all fingerprint-based methods, they (along with other graph-based methods) represent a new paradigm in ligand-based virtual screening with exciting opportunities for future improvement.
Comments: Changed cross-validation scheme to use a held-out validation set and made other changes in response to reviewer comments, such as including comparisons to additional models and adding more background for the methods. Under review by the Journal of Computer-Aided Molecular Design (JCAMD)
Subjects: Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as: arXiv:1603.00856 [stat.ML]
  (or arXiv:1603.00856v2 [stat.ML] for this version)

Submission history

From: Steven Kearnes [view email]
[v1] Wed, 2 Mar 2016 20:34:08 GMT (855kb,D)
[v2] Mon, 1 Aug 2016 22:26:57 GMT (905kb,D)
[v3] Thu, 18 Aug 2016 17:17:05 GMT (907kb,D)

Link back to: arXiv, form interface, contact.