Current browse context:
physics.chem-ph
Change to browse by:
References & Citations
Physics > Chemical Physics
Title: Selected Machine Learning of HOMO-LUMO gaps with Improved Data-Efficiency
(Submitted on 6 Oct 2021)
Abstract: Quantum Machine Learning (QML) models of molecular HOMO-LUMO-gaps often struggle to achieve satisfying data-efficiency as measured by decreasing prediction errors for increasing training set sizes. Partitioning training sets of organic molecules (QM7 and QM9-data-sets) into three classes [systems containing either aromatic rings and carbonyl groups, or single unsaturated bonds, or saturated bonds] prior to training results in independently trained QML models with improved learning rates. The selected QML models of band-gaps (at GW, B3LYP, and ZINDO level of theory) reach mean absolute prediction errors of $\sim$0.1 eV for up to an order of magnitude fewer training molecules than for conventionally trained models. Direct comparison to $\Delta$-QML models of band-gaps suggest that selected QML possesses substantially more data-efficiency. The findings suggest that selected QML, e.g. based on simple classifications prior to training, could help to successfully tackle challenging quantum property screening tasks of large libraries with high fidelity and low computational burden.
Link back to: arXiv, form interface, contact.