Yao Songyuan, Van Richard, Pan Xiaoliang, Park Ji Hwan, Mao Yuezhi, Pu Jingzhi, Mei Ye, Shao Yihan
Department of Chemistry and Biochemistry, University of Oklahoma Norman OK 73019 USA
School of Computer Science, University of Oklahoma Norman OK 73019 USA.
RSC Adv. 2023 Feb 3;13(7):4565-4577. doi: 10.1039/d2ra08180f. eCollection 2023 Jan 31.
Inspired by the recent work from Noé and coworkers on the development of machine learning based implicit solvent model for the simulation of solvated peptides [Chen , , 2021, , 084101], here we report another investigation of the possibility of using machine learning (ML) techniques to "derive" an implicit solvent model directly from explicit solvent molecular dynamics (MD) simulations. For alanine dipeptide, a machine learning potential (MLP) based on the DeepPot-SE representation of the molecule was trained to capture its interactions with its average solvent environment configuration (ASEC). The predicted forces on the solute deviated only by an RMSD of 0.4 kcal mol Å from the reference values, and the MLP-based free energy surface differed from that obtained from explicit solvent MD simulations by an RMSD of less than 0.9 kcal mol. Our MLP training protocol could also accurately reproduce combined quantum mechanical molecular mechanical (QM/MM) forces on the quantum mechanical (QM) solute in ASEC environment, thus enabling the development of accurate ML-based implicit solvent models for -QM MD simulations. Such ML-based implicit solvent models for QM calculations are cost-effective in both the training stage, where the use of ASEC reduces the number of data points to be labelled, and the inference stage, where the MLP can be evaluated at a relatively small additional cost on top of the QM calculation of the solute.
受诺埃及其同事最近关于开发基于机器学习的隐式溶剂模型以模拟溶剂化肽的工作启发[陈,,2021,,084101],在此我们报告另一项关于使用机器学习(ML)技术直接从显式溶剂分子动力学(MD)模拟“推导”隐式溶剂模型可能性的研究。对于丙氨酸二肽,基于分子的DeepPot-SE表示训练了一种机器学习势(MLP),以捕捉其与平均溶剂环境构型(ASEC)的相互作用。溶质上预测的力与参考值的均方根偏差(RMSD)仅为0.4 kcal mol Å,基于MLP的自由能表面与从显式溶剂MD模拟获得的自由能表面的RMSD小于0.9 kcal mol。我们的MLP训练协议还可以在ASEC环境中准确再现量子力学溶质上的组合量子力学分子力学(QM/MM)力,从而能够为-QM MD模拟开发准确的基于ML的隐式溶剂模型。这种用于QM计算的基于ML的隐式溶剂模型在训练阶段(使用ASEC减少了需要标记的数据点数量)和推理阶段(在溶质的QM计算之上,可以以相对较小的额外成本评估MLP)都具有成本效益。