Bystrom Kyle, Falletta Stefano, Kozinsky Boris
Harvard John A. Paulson School of Engineering and Applied Sciences, Cambridge, Massachusetts 02138, United States.
Robert Bosch LLC Research and Technology Center, Cambridge, Massachusetts 02139, United States.
J Chem Theory Comput. 2024 Sep 10;20(17):7516-7532. doi: 10.1021/acs.jctc.4c00999. Epub 2024 Aug 23.
The systematic underestimation of band gaps is one of the most fundamental challenges in semilocal density functional theory (DFT). In addition to hindering the application of DFT to predicting electronic properties, the band gap problem is intimately related to self-interaction and delocalization errors, which make the study of charge transfer mechanisms with DFT difficult. To expand the range of available tools for addressing the band gap problem, we design an approach for machine learning density functionals based on Gaussian processes to explicitly fit single-particle energy levels. We also introduce nonlocal features of the density matrix that are expressive enough to fit these single-particle levels. Combining these developments, we train a machine-learned functional for the exact exchange energy that predicts molecular energy gaps and reaction energies of a wide range of molecules in excellent agreement with reference hybrid DFT calculations. In addition, while being trained solely on molecular data, our model predicts reasonable formation energies of polarons in solids, showcasing its transferability and robustness. We discuss how this approach can be generalized to full exchange-correlation functionals, thus paving the way to the design of state-of-the-art functionals for the prediction of electronic properties of molecules and materials.
能带隙的系统性低估是半局域密度泛函理论(DFT)中最基本的挑战之一。除了阻碍DFT在预测电子性质方面的应用外,能带隙问题还与自相互作用和离域误差密切相关,这使得用DFT研究电荷转移机制变得困难。为了扩展解决能带隙问题的可用工具范围,我们设计了一种基于高斯过程的机器学习密度泛函方法,以明确拟合单粒子能级。我们还引入了密度矩阵的非局部特征,这些特征足以表达以拟合这些单粒子能级。结合这些进展,我们训练了一个用于精确交换能的机器学习泛函,该泛函能预测多种分子的分子能隙和反应能,与参考杂化DFT计算结果高度吻合。此外,虽然我们的模型仅在分子数据上进行训练,但它能预测固体中极化子的合理形成能,展示了其可转移性和鲁棒性。我们讨论了如何将这种方法推广到完整的交换关联泛函,从而为设计用于预测分子和材料电子性质的先进泛函铺平道路。