Division of National Supercomputing, Korea Institute of Science and Technology Information, 245 Daehak-ro, Yuseong-gu, 34141, Daejeon, Republic of Korea.
Nat Commun. 2023 Mar 1;14(1):1168. doi: 10.1038/s41467-023-36823-3.
The elucidation of transition state (TS) structures is essential for understanding the mechanisms of chemical reactions and exploring reaction networks. Despite significant advances in computational approaches, TS searching remains a challenging problem owing to the difficulty of constructing an initial structure and heavy computational costs. In this paper, a machine learning (ML) model for predicting the TS structures of general organic reactions is proposed. The proposed model derives the interatomic distances of a TS structure from atomic pair features reflecting reactant, product, and linearly interpolated structures. The model exhibits excellent accuracy, particularly for atomic pairs in which bond formation or breakage occurs. The predicted TS structures yield a high success ratio (93.8%) for quantum chemical saddle point optimizations, and 88.8% of the optimization results have energy errors of less than 0.1 kcal mol. Additionally, as a proof of concept, the exploration of multiple reaction paths of an organic reaction is demonstrated based on ML inferences. I envision that the proposed approach will aid in the construction of initial geometries for TS optimization and reaction path exploration.
阐明过渡态(TS)结构对于理解化学反应机制和探索反应网络至关重要。尽管在计算方法方面取得了重大进展,但由于难以构建初始结构和计算成本高昂,TS 搜索仍然是一个具有挑战性的问题。本文提出了一种用于预测一般有机反应 TS 结构的机器学习(ML)模型。该模型从原子对特征中导出 TS 结构的原子间距离,这些特征反映了反应物、产物和线性插值结构。该模型具有出色的准确性,特别是对于发生键形成或断裂的原子对。预测的 TS 结构在量子化学鞍点优化中具有很高的成功率(93.8%),并且 88.8%的优化结果的能量误差小于 0.1 kcal mol。此外,作为概念验证,本文基于 ML 推理演示了有机反应的多条反应路径的探索。我设想,所提出的方法将有助于构建 TS 优化和反应路径探索的初始几何形状。