Choutka Jan, Kaminský Jakub, Wang Ercheng, Parkan Kamil, Pohl Radek
Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Gilead Sciences & IOCB Research Centre, Flemingovo nám. 2, 166 10 Prague, Czech Republic.
Zhejiang Laboratory, Hangzhou 311100, China.
J Chem Inf Model. 2025 Jan 27;65(2):762-777. doi: 10.1021/acs.jcim.4c01659. Epub 2025 Jan 4.
The use of quantum mechanical potentials in protein-ligand affinity prediction is becoming increasingly feasible with growing computational power. To move forward, validation of such potentials on real-world challenges is necessary. To this end, we have collated an extensive set of over a thousand galectin inhibitors with known affinities and docked them into galectin-3. The docked poses were then used to systematically evaluate several modern force fields and semiempirical quantum mechanical (SQM) methods up to the tight-binding level under consistent computational workflow. Implicit solvation models available with the tested methods were used to simulate solvation effects. Overall, the best methods in this study achieved a Pearson correlation of 0.7-0.8 between the computed and experimental affinities. There were differences between the tested methods in their ability to rank ligands across the entire ligand set as well as within subsets of structurally similar ligands. A major discrepancy was observed for a subset of ligands that bind to the protein via a halogen bond, which was clearly challenging for all the tested methods. The inclusion of an entropic term calculated by the rigid-rotor-harmonic-oscillator approximation at SQM level slightly worsened correlation with experiment but brought the calculated affinities closer to experimental values. We also found that the success of the prediction strongly depended on the solvation model. Furthermore, we provide an in-depth analysis of the individual energy terms and their effect on the overall prediction accuracy.
随着计算能力的不断提高,在蛋白质-配体亲和力预测中使用量子力学势变得越来越可行。为了取得进展,有必要在实际挑战中对这种势进行验证。为此,我们整理了一组广泛的、超过一千种具有已知亲和力的半乳糖凝集素抑制剂,并将它们对接至半乳糖凝集素-3。然后,利用对接构象在一致的计算工作流程下系统地评估了几种现代力场和直至紧束缚水平的半经验量子力学(SQM)方法。使用测试方法中可用的隐式溶剂化模型来模拟溶剂化效应。总体而言,本研究中最佳的方法在计算亲和力和实验亲和力之间达到了0.7 - 0.8的皮尔逊相关系数。在对整个配体集以及结构相似配体子集中的配体进行排序的能力方面,测试方法之间存在差异。对于通过卤素键与蛋白质结合的一部分配体,观察到了一个主要差异,这对所有测试方法来说显然都具有挑战性。在SQM水平上通过刚性转子-谐振子近似计算的熵项的纳入,略微降低了与实验的相关性,但使计算出的亲和力更接近实验值。我们还发现预测的成功强烈依赖于溶剂化模型。此外,我们对各个能量项及其对整体预测准确性的影响进行了深入分析。