Bhati Agastya P, Wan Shunzhou, Coveney Peter V
Centre for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, United Kingdom.
Computational Science Laboratory, Institute for Informatics, Faculty of Science, University of Amsterdam, Amsterdam 1012, The Netherlands.
J Chem Theory Comput. 2025 Jan 14;21(1):440-462. doi: 10.1021/acs.jctc.4c01389. Epub 2024 Dec 16.
Free energy calculations for protein-ligand complexes have become widespread in recent years owing to several conceptual, methodological and technological advances. Central among these is the use of ensemble methods which permits accurate, precise and reproducible predictions and is necessary for uncertainty quantification. Absolute binding free energies (ABFEs) are challenging to predict using alchemical methods and their routine application in drug discovery has remained out of reach until now. Here, we apply ensemble alchemical ABFE methods to a large data set comprising 219 ligand-protein complexes and obtain statistically robust results with high accuracy (<1 kcal/mol). We compare equilibrium and nonequilibrium methods for ABFE predictions at large scale and provide a systematic critical assessment of each method. The equilibrium method is more accurate, precise, faster, computationally more cost-effective and requires a much simpler protocol, making it preferable for large scale and blind applications. We find that the calculated free energy distributions are non-normal and discuss the consequences. We recommend a definitive protocol to perform ABFE calculations optimally. Using this protocol, it is possible to perform thousands of ABFE calculations within a few hours on modern exascale machines.
近年来,由于在概念、方法和技术方面取得了多项进展,蛋白质-配体复合物的自由能计算已变得十分普遍。其中核心的进展是使用系综方法,该方法能够进行准确、精确且可重复的预测,并且对于不确定性量化是必要的。使用炼金术方法预测绝对结合自由能(ABFE)具有挑战性,直到现在其在药物发现中的常规应用仍难以实现。在此,我们将系综炼金术ABFE方法应用于包含219个配体-蛋白质复合物的大数据集,并获得了具有高精度(<1千卡/摩尔)的统计稳健结果。我们在大规模上比较了用于ABFE预测的平衡和非平衡方法,并对每种方法进行了系统的批判性评估。平衡方法更准确、精确、快速,计算成本效益更高,并且需要的方案更简单,因此更适合大规模和盲目应用。我们发现计算出的自由能分布是非正态的,并讨论了其后果。我们推荐一种明确的方案以最佳地进行ABFE计算。使用该方案,在现代百亿亿次计算机上几小时内就可以进行数千次ABFE计算。