Current browse context:
physics.chem-ph
Change to browse by:
References & Citations
Physics > Chemical Physics
Title: Training Algorithm Matters for the Performance of Neural Network Potential: A Case Study of Adam and the Kalman Filter Optimizers
(Submitted on 8 Sep 2021 (v1), last revised 9 Nov 2021 (this version, v3))
Abstract: One hidden yet important issue for developing neural network potentials (NNPs) is the choice of training algorithm. Here we compare the performance of two popular training algorithms, the adaptive moment estimation algorithm (Adam) and the Extended Kalman Filter algorithm (EKF), using the Behler-Parrinello neural network (BPNN) and two publicly accessible datasets of liquid water [Proc. Natl. Acad. Sci. U.S.A. 2016, 113, 8368-8373 and Proc. Natl. Acad. Sci. U.S.A. 2019, 116, 1110-1115]. This is achieved by implementing EKF in TensorFlow. It is found that NNPs trained with EKF are more transferable and less sensitive to the value of the learning rate, as compared to Adam. In both cases, error metrics of the validation set do not always serve as a good indicator for the actual performance of NNPs. Instead, we show that their performance correlates well with a Fisher information based similarity measure.
Submission history
From: Chao Zhang Dr. [view email][v1] Wed, 8 Sep 2021 16:48:33 GMT (1796kb,D)
[v2] Mon, 25 Oct 2021 11:22:03 GMT (1235kb,D)
[v3] Tue, 9 Nov 2021 15:46:37 GMT (2303kb,D)
Link back to: arXiv, form interface, contact.