Zhang Jintu, Zhang Odin, Bonati Luigi, Hou TingJun
Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China.
Atomistic Simulations, Italian Institute of Technology, Genova 16152, Italy.
J Chem Theory Comput. 2024 Jun 11;20(11):4523-4532. doi: 10.1021/acs.jctc.4c00423. Epub 2024 May 27.
Rare event sampling is a central problem in modern computational chemistry research. Among the existing methods, transition path sampling (TPS) can generate unbiased representations of reaction processes. However, its efficiency depends on the ability to generate reactive trial paths, which in turn depends on the quality of the shooting algorithm used. We propose a new algorithm based on the shooting success rate, i.e., reactivity, measured as a function of a reduced set of collective variables (CVs). These variables are extracted with a machine learning approach directly from TPS simulations, using a multitask objective function. Iteratively, this workflow significantly improves the shooting efficiency without any prior knowledge of the process. In addition, the optimized CVs can be used with biased enhanced sampling methodologies to accurately reconstruct the free energy profiles. We tested the method on three different systems: a two-dimensional toy model, conformational transitions of alanine dipeptide, and hydrolysis of acetyl chloride in bulk water. In the latter, we integrated our workflow with an active learning scheme to learn a reactive machine learning-based potential, which allowed us to study the mechanism and free energy profile with an ab initio-like accuracy.
稀有事件采样是现代计算化学研究中的一个核心问题。在现有方法中,过渡路径采样(TPS)可以生成反应过程的无偏表示。然而,其效率取决于生成反应性试验路径的能力,而这又取决于所使用的射击算法的质量。我们提出了一种基于射击成功率(即反应性)的新算法,该成功率是作为一组简化的集体变量(CVs)的函数来衡量的。这些变量通过机器学习方法直接从TPS模拟中提取,使用多任务目标函数。迭代地,这种工作流程显著提高了射击效率,而无需对过程有任何先验知识。此外,优化后的CVs可与有偏增强采样方法一起使用,以准确重建自由能分布。我们在三个不同的系统上测试了该方法:一个二维玩具模型、丙氨酸二肽的构象转变以及乙酰氯在大量水中的水解。在后者中,我们将我们的工作流程与主动学习方案相结合,以学习基于反应性机器学习的势能,这使我们能够以类似从头算的精度研究反应机理和自由能分布。