Suppr超能文献

羧基化柱[6]芳烃主客体结合中终点自由能技术的综合评价:II. 回归与介电常数

Comprehensive evaluation of end-point free energy techniques in carboxylated-pillar[6]arene host-guest binding: II. regression and dielectric constant.

作者信息

Liu Xiao, Zheng Lei, Cong Yalong, Gong Zhihao, Yin Zhixiang, Zhang John Z H, Liu Zhirong, Sun Zhaoxi

机构信息

School of Mathematics, Physics and Statistics, Shanghai University of Engineering Science, Shanghai, 201620, China.

NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, 200062, China.

出版信息

J Comput Aided Mol Des. 2022 Dec;36(12):879-894. doi: 10.1007/s10822-022-00487-w. Epub 2022 Nov 17.

Abstract

End-point free energy calculations as a powerful tool have been widely applied in protein-ligand and protein-protein interactions. It is often recognized that these end-point techniques serve as an option of intermediate accuracy and computational cost compared with more rigorous statistical mechanic models (e.g., alchemical transformation) and coarser molecular docking. However, it is observed that this intermediate level of accuracy does not hold in relatively simple and prototypical host-guest systems. Specifically, in our previous work investigating a set of carboxylated-pillar[6]arene host-guest complexes, end-point methods provide free energy estimates deviating significantly from the experimental reference, and the rank of binding affinities is also incorrectly computed. These observations suggest the unsuitability and inapplicability of standard end-point free energy techniques in host-guest systems, and alteration and development are required to make them practically usable. In this work, we consider two ways to improve the performance of end-point techniques. The first one is the PBSA_E regression that varies the weights of different free energy terms in the end-point calculation procedure, while the second one is considering the interior dielectric constant as an additional variable in the end-point equation. By detailed investigation of the calculation procedure and the simulation outcome, we prove that these two treatments (i.e., regression and dielectric constant) are manipulating the end-point equation in a somehow similar way, i.e., weakening the electrostatic contribution and strengthening the non-polar terms, although there are still many detailed differences between these two methods. With the trained end-point scheme, the RMSE of the computed affinities is improved from the standard ~ 12 kcal/mol to ~ 2.4 kcal/mol, which is comparable to another altered end-point method (ELIE) trained with system-specific data. By tuning PBSA_E weighting factors with the host-specific data, it is possible to further decrease the prediction error to ~ 2.1 kcal/mol. These observations along with the extremely efficient optimized-structure computation procedure suggest the regression (i.e., PBSA_E as well as its GBSA_E extension) as a practically applicable solution that brings end-point methods back into the library of usable tools for host-guest binding. However, the dielectric-constant-variable scheme cannot effectively minimize the experiment-calculation discrepancy for absolute binding affinities, but is able to improve the calculation of affinity ranks. This phenomenon is somehow different from the protein-ligand case and suggests the difference between host-guest and biomacromolecular (protein-ligand and protein-protein) systems. Therefore, the spectrum of tools usable for protein-ligand complexes could be unsuitable for host-guest binding, and numerical validations are necessary to screen out really workable solutions in these 'prototypical' situations.

摘要

端点自由能计算作为一种强大的工具,已广泛应用于蛋白质 - 配体和蛋白质 - 蛋白质相互作用中。人们普遍认为,与更严格的统计力学模型(例如炼金术变换)和更粗糙的分子对接相比,这些端点技术是一种具有中等精度和计算成本的选择。然而,据观察,这种中等精度水平在相对简单和典型的主客体系统中并不成立。具体而言,在我们之前研究一组羧基化柱[6]芳烃主客体配合物的工作中,端点方法提供的自由能估计值与实验参考值有显著偏差,并且结合亲和力的排序也计算错误。这些观察结果表明标准端点自由能技术在主客体系统中不适用,需要进行改进和发展才能使其实际可用。在这项工作中,我们考虑了两种提高端点技术性能的方法。第一种是PBSA_E回归,它在端点计算过程中改变不同自由能项的权重,而第二种是将内部介电常数作为端点方程中的一个额外变量。通过对计算过程和模拟结果的详细研究,我们证明这两种处理方法(即回归和介电常数)以某种相似的方式操纵端点方程,即削弱静电贡献并加强非极性项,尽管这两种方法之间仍存在许多细节差异。使用经过训练的端点方案,计算亲和力的均方根误差从标准的约12千卡/摩尔提高到约2.4千卡/摩尔,这与另一种使用特定系统数据训练的改进端点方法(ELIE)相当。通过使用特定主体数据调整PBSA_E加权因子,可以进一步将预测误差降低到约2.1千卡/摩尔。这些观察结果以及极其高效的优化结构计算过程表明,回归方法(即PBSA_E及其GBSA_E扩展)是一种实际可行的解决方案,使端点方法重新成为可用于主客体结合的工具库。然而,介电常数可变方案不能有效地最小化绝对结合亲和力的实验 - 计算差异,但能够改善亲和力排序的计算。这种现象在某种程度上与蛋白质 - 配体情况不同,表明主客体系统与生物大分子(蛋白质 - 配体和蛋白质 - 蛋白质)系统之间存在差异。因此,可用于蛋白质 - 配体复合物的工具范围可能不适用于主客体结合,在这些“典型”情况下需要进行数值验证以筛选出真正可行的解决方案。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验