Molecule Attention Transformer

Maziarka, Łukasz; Danel, Tomasz; Mucha, Sławomir; Rataj, Krzysztof; Tabor, Jacek; Jastrzębski, Stanisław

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2002

Computer Science > Machine Learning

Title: Molecule Attention Transformer

Authors: Łukasz Maziarka, Tomasz Danel, Sławomir Mucha, Krzysztof Rataj, Jacek Tabor, Stanisław Jastrzębski

(Submitted on 19 Feb 2020)

Abstract: Designing a single neural network architecture that performs competitively across a range of molecule property prediction tasks remains largely an open challenge, and its solution may unlock a widespread use of deep learning in the drug discovery industry. To move towards this goal, we propose Molecule Attention Transformer (MAT). Our key innovation is to augment the attention mechanism in Transformer using inter-atomic distances and the molecular graph structure. Experiments show that MAT performs competitively on a diverse set of molecular prediction tasks. Most importantly, with a simple self-supervised pretraining, MAT requires tuning of only a few hyperparameter values to achieve state-of-the-art performance on downstream tasks. Finally, we show that attention weights learned by MAT are interpretable from the chemical point of view.

Subjects:	Machine Learning (cs.LG); Computational Physics (physics.comp-ph); Machine Learning (stat.ML)
Journal reference:	Graph Representation Learning workshop and Machine Learning and the Physical Sciences workshop at NeurIPS 2019
Cite as:	arXiv:2002.08264 [cs.LG]
	(or arXiv:2002.08264v1 [cs.LG] for this version)

Submission history

From: Łukasz Maziarka [view email]
[v1] Wed, 19 Feb 2020 16:14:48 GMT (287kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2002.08264

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Molecule Attention Transformer

Submission history