Forouzesh Negin, Ghafouri Fatemeh, Tolokh Igor S, Onufriev Alexey V
Department of Computer Science, California State University, Los Angeles, California 90032, United States.
Genetics, Bioinformatics, and Computational Biology, Virginia Polytechnic Institute & State University, Blacksburg, Virginia 24061, United States.
J Chem Inf Model. 2024 Dec 23;64(24):9433-9448. doi: 10.1021/acs.jcim.4c01190. Epub 2024 Dec 10.
Accuracy of binding free energy calculations utilizing implicit solvent models is critically affected by parameters of the underlying dielectric boundary, specifically, the atomic and water probe radii. Here, a multidimensional optimization pipeline is used to find optimal atomic radii, specifically for binding calculations in the implicit solvent. To reduce overfitting, the optimization target includes separate, weighted contributions from both binding and hydration free energies. The resulting five-parameter radii set, OPT_BIND5D, is evaluated against experiment for binding free energies of 20 host-guest (H-G) systems, unrelated to the types of structures used in the training. The resulting accuracy for this H-G test set (root mean square error of 2.03 kcal/mol, mean signed error of -0.13 kcal/mol, mean absolute error of 1.68 kcal/mol, and Pearson's correlation of = 0.79 with the experimental values) is on par with what can be expected from the fixed charge explicit solvent models. Best agreement with the experiment is achieved when the implicit salt concentration is set equal or close to the experimental conditions.
利用隐式溶剂模型进行结合自由能计算的准确性受到潜在介电边界参数的严重影响,特别是原子和水探针半径。在此,使用多维优化管道来寻找最佳原子半径,特别是用于隐式溶剂中的结合计算。为了减少过拟合,优化目标包括结合自由能和水化自由能的单独加权贡献。所得的五参数半径集OPT_BIND5D针对20个主客体(H-G)系统的结合自由能进行了实验评估,这些系统与训练中使用的结构类型无关。该H-G测试集的所得准确性(均方根误差为2.03 kcal/mol,平均符号误差为-0.13 kcal/mol,平均绝对误差为1.68 kcal/mol,与实验值的皮尔逊相关系数为0.79)与固定电荷显式溶剂模型的预期相当。当隐式盐浓度设置为等于或接近实验条件时,与实验的一致性最佳。