Gönnheimer Nils, Reuter Karsten, Margraf Johannes T
Bavarian Center for Battery Technology (BayBatt), University of Bayreuth, Bayreuth 95448, Germany.
Fritz Haber Institute of the Max Planck Society, Berlin 14195, Germany.
J Chem Theory Comput. 2025 May 13;21(9):4742-4752. doi: 10.1021/acs.jctc.4c01790. Epub 2025 Apr 24.
The development of machine learning interatomic potentials (MLIPs) has revolutionized computational chemistry by enhancing the accuracy of empirical force fields while retaining a large computational speed-up compared to first-principles calculations. Despite these advancements, the calculation of Hessian matrices for large systems remains challenging, in particular because analytical second-order derivatives are often not implemented. This necessitates the use of computationally expensive finite-difference methods, which can furthermore display low precision in some cases. Automatic differentiation (AD) offers a promising alternative to reduce this computational effort and makes the calculation of Hessian matrices more efficient and accurate. Here, we present the implementation of AD-based second-order derivatives for the popular MACE equivariant graph neural network architecture. The benefits of this method are showcased via a high-throughput prediction of heat capacities of porous materials with the MACE-MP-0 foundation model. This is essential for precisely describing gas adsorption in these systems and was previously possible only with bespoke ML models or expensive first-principles calculations. We find that the availability of foundation models and accurate analytical Hessian matrices offers comparable accuracy to bespoke ML models in a zero-shot manner and additionally allows for the investigation of finite-size and rounding errors in the first-principles data.
机器学习原子间势(MLIPs)的发展彻底改变了计算化学,它提高了经验力场的准确性,同时与第一性原理计算相比,保持了大幅的计算加速。尽管有这些进展,但对于大型系统的海森矩阵计算仍然具有挑战性,特别是因为通常没有实现解析二阶导数。这就需要使用计算成本高昂的有限差分方法,而且在某些情况下这些方法的精度可能较低。自动微分(AD)提供了一种有前景的替代方法来减少这种计算量,并使海森矩阵的计算更高效、准确。在这里,我们展示了针对流行的MACE等变图神经网络架构基于AD的二阶导数的实现。通过使用MACE-MP-0基础模型对多孔材料的热容进行高通量预测,展示了该方法的优势。这对于精确描述这些系统中的气体吸附至关重要,而此前只有通过定制的机器学习模型或昂贵的第一性原理计算才能实现。我们发现,基础模型和精确的解析海森矩阵的可用性以零样本的方式提供了与定制机器学习模型相当的准确性,并且还允许研究第一性原理数据中的有限尺寸和舍入误差。