Xue Bao-Xin, Barbatti Mario, Dral Pavlo O
State Key Laboratory of Physical Chemistry of Solid Surfaces, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Department of Chemistry, and College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, P. R. China.
Aix Marseille University, CNRS, ICR, Marseille, France.
J Phys Chem A. 2020 Sep 3;124(35):7199-7210. doi: 10.1021/acs.jpca.0c05310. Epub 2020 Aug 25.
We present a machine learning (ML) method to accelerate the nuclear ensemble approach (NEA) for computing absorption cross sections. ML-NEA is used to calculate cross sections on vast ensembles of nuclear geometries to reduce the error due to insufficient statistical sampling. The electronic properties-excitation energies and oscillator strengths-are calculated with a reference electronic structure method only for a relatively few points in the ensemble. The KREG model (kernel-ridge-regression-based ML combined with the RE descriptor) as implemented in MLatom is used to predict these properties for the remaining tens of thousands of points in the ensemble without incurring much of additional computational cost. We demonstrate for two examples, benzene and a 9-dicyanomethylene derivative of acridine, that ML-NEA can produce statistically converged cross sections even for very challenging cases and even with as few as several hundreds of training points.
我们提出了一种机器学习(ML)方法,以加速用于计算吸收截面的核系综方法(NEA)。ML-NEA用于计算大量核几何构型系综上的截面,以减少由于统计采样不足而产生的误差。电子性质——激发能和振子强度——仅使用参考电子结构方法对系综中相对较少的点进行计算。MLatom中实现的KREG模型(基于核岭回归的机器学习与RE描述符相结合)用于预测系综中其余数万个点的这些性质,而不会产生太多额外的计算成本。我们以苯和吖啶的9-二氰基亚甲基衍生物这两个例子证明,即使对于极具挑战性的情况,甚至只有几百个训练点,ML-NEA也能产生统计收敛的截面。