Busk Jonas, Schmidt Mikkel N, Winther Ole, Vegge Tejs, Jørgensen Peter Bjørn
Department of Energy Conversion and Storage, Technical University of Denmark, Kongens Lyngby, Denmark.
Department of Applied Mathematics and Computer Science, Technical University of Denmark, Kongens Lyngby, Denmark.
Phys Chem Chem Phys. 2023 Sep 27;25(37):25828-25837. doi: 10.1039/d3cp02143b.
Inexpensive machine learning (ML) potentials are increasingly being used to speed up structural optimization and molecular dynamics simulations of materials by iteratively predicting and applying interatomic forces. In these settings, it is crucial to detect when predictions are unreliable to avoid wrong or misleading results. Here, we present a complete framework for training and recalibrating graph neural network ensemble models to produce accurate predictions of energy and forces with calibrated uncertainty estimates. The proposed method considers both epistemic and aleatoric uncertainty and the total uncertainties are recalibrated using a nonlinear scaling function to achieve good calibration on previously unseen data, without loss of predictive accuracy. The method is demonstrated and evaluated on two challenging, publicly available datasets, ANI-1x (Smith , 2018, , 241733.) and Transition1x (Schreiner , 2022, , 779.), both containing diverse conformations far from equilibrium. A detailed analysis of the predictive performance and uncertainty calibration is provided. In all experiments, the proposed method achieved low prediction error and good uncertainty calibration, with predicted uncertainty correlating with expected error, on energy and forces. To the best of our knowledge, the method presented in this paper is the first to consider a complete framework for obtaining calibrated epistemic and aleatoric uncertainty predictions on both energy and forces in ML potentials.
成本低廉的机器学习(ML)势能正越来越多地用于通过迭代预测和应用原子间力来加速材料的结构优化和分子动力学模拟。在这些情况下,检测预测何时不可靠以避免错误或误导性结果至关重要。在此,我们提出了一个完整的框架,用于训练和重新校准图神经网络集成模型,以通过校准后的不确定性估计来准确预测能量和力。所提出的方法同时考虑了认知不确定性和偶然不确定性,并且使用非线性缩放函数对总不确定性进行重新校准,以便在先前未见过的数据上实现良好的校准,同时不损失预测准确性。该方法在两个具有挑战性的公开可用数据集ANI - 1x(史密斯,2018,,241733.)和Transition1x(施赖纳,2022,,779.)上进行了演示和评估,这两个数据集都包含远离平衡的各种构象。提供了对预测性能和不确定性校准的详细分析。在所有实验中,所提出的方法在能量和力方面均实现了低预测误差和良好的不确定性校准,预测的不确定性与预期误差相关。据我们所知,本文提出的方法是首个考虑用于在ML势能中获得关于能量和力的校准后的认知和偶然不确定性预测的完整框架。