Bremen Center for Computational Materials Science, University of Bremen, 28359 Bremen, Germany.
Warwick Centre for Predictive Modelling, School of Engineering, University of Warwick, Coventry CV4 7AL, United Kingdom.
J Chem Theory Comput. 2023 Jul 11;19(13):3877-3888. doi: 10.1021/acs.jctc.3c00152. Epub 2023 Jun 23.
Density functional tight binding (DFTB) is an approximate density functional based quantum chemical simulation method with low computational cost. In order to increase its accuracy, we have introduced a machine learning algorithm to optimize several parameters of the DFTB method, concentrating on solids with defects. The backpropagation algorithm was used to reduce the error between DFTB and DFT results with respect to the training data set and to obtain adjusted DFTB Hamiltonian and overlap matrix elements. Afterward, the generalization capability of the trained model was tested for geometries not being part of the training set. In the current work, we have focused on defective periodic silicon and silicon carbide systems as target materials and the density of states (DOS) as target property to demonstrate the feasibility of our approach. The trained model was able to reduce the differences between the DFTB and DFT DOS significantly, while other derived properties (for example, Mulliken population distribution, projected DOS) remained physically sound. Also, the transferability of the obtained model could be verified. Our method allows to carry out relatively fast simulations with high accuracy and only moderate training efforts, and represents a good compromise for cases, where long-range effects make direct machine learning predictions difficult.
密度泛函紧束缚(DFTB)是一种具有低计算成本的近似密度泛函的量子化学模拟方法。为了提高其准确性,我们引入了机器学习算法来优化 DFTB 方法的几个参数,重点是具有缺陷的固体。反向传播算法用于减小 DFTB 与 DFT 结果之间相对于训练数据集的误差,并获得调整后的 DFTB 哈密顿量和重叠矩阵元素。之后,测试了经过训练的模型对不属于训练集的几何形状的泛化能力。在当前的工作中,我们专注于有缺陷的周期性硅和碳化硅系统作为目标材料,以及态密度(DOS)作为目标特性,以证明我们方法的可行性。训练后的模型能够显著减小 DFTB 和 DFT DOS 之间的差异,而其他衍生特性(例如,Mulliken 布居分布、投影 DOS)仍然具有物理意义。此外,还可以验证获得的模型的可转移性。我们的方法允许以高精度进行相对快速的模拟,并且仅需要适度的训练工作,对于那些由于远程效应使得直接进行机器学习预测变得困难的情况,这是一个很好的折衷方案。