We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

physics.chem-ph

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Physics > Chemical Physics

Title: Selected Machine Learning of HOMO-LUMO gaps with Improved Data-Efficiency

Abstract: Quantum Machine Learning (QML) models of molecular HOMO-LUMO-gaps often struggle to achieve satisfying data-efficiency as measured by decreasing prediction errors for increasing training set sizes. Partitioning training sets of organic molecules (QM7 and QM9-data-sets) into three classes [systems containing either aromatic rings and carbonyl groups, or single unsaturated bonds, or saturated bonds] prior to training results in independently trained QML models with improved learning rates. The selected QML models of band-gaps (at GW, B3LYP, and ZINDO level of theory) reach mean absolute prediction errors of $\sim$0.1 eV for up to an order of magnitude fewer training molecules than for conventionally trained models. Direct comparison to $\Delta$-QML models of band-gaps suggest that selected QML possesses substantially more data-efficiency. The findings suggest that selected QML, e.g. based on simple classifications prior to training, could help to successfully tackle challenging quantum property screening tasks of large libraries with high fidelity and low computational burden.
Comments: 19 pages, 20 figures
Subjects: Chemical Physics (physics.chem-ph)
Cite as: arXiv:2110.02596 [physics.chem-ph]
  (or arXiv:2110.02596v1 [physics.chem-ph] for this version)

Submission history

From: Bernard Mazouin [view email]
[v1] Wed, 6 Oct 2021 09:03:33 GMT (3389kb,D)

Link back to: arXiv, form interface, contact.