Centre for Computational Science, Department of Chemistry, University College London, London WC1H 0AJ, U.K.
Advanced Research Computing Centre, University College London, London WC1H 0AJ, U.K.
J Chem Theory Comput. 2023 Nov 14;19(21):7846-7860. doi: 10.1021/acs.jctc.3c00842. Epub 2023 Oct 20.
Alchemical relative binding free energy calculations have recently found important applications in drug optimization. A series of congeneric compounds are generated from a preidentified lead compound, and their relative binding affinities to a protein are assessed in order to optimize candidate drugs. While methods based on equilibrium thermodynamics have been extensively studied, an approach based on nonequilibrium methods has recently been reported together with claims of its superiority. However, these claims pay insufficient attention to the basis and reliability of both methods. Here we report a comparative study of the two approaches across a large data set, comprising more than 500 ligand transformations spanning in excess of 300 ligands binding to a set of 14 diverse protein targets. Ensemble methods are essential to quantify the uncertainty in these calculations, not only for the reasons already established in the equilibrium approach but also to ensure that the nonequilibrium calculations reside within their domain of validity. If and only if ensemble methods are applied, we find that the nonequilibrium method can achieve accuracy and precision comparable to those of the equilibrium approach. Compared to the equilibrium method, the nonequilibrium approach can reduce computational costs but introduces higher computational complexity and longer wall clock times. There are, however, cases where the standard length of a nonequilibrium transition is not sufficient, necessitating a complete rerun of the entire set of transitions. This significantly increases the computational cost and proves to be highly inconvenient during large-scale applications. Our findings provide a key set of recommendations that should be adopted for the reliable implementation of nonequilibrium approaches to relative binding free energy calculations in ligand-protein systems.
最近,化学相对结合自由能计算在药物优化中得到了重要的应用。从预先确定的先导化合物生成一系列同系物化合物,并评估它们与蛋白质的相对结合亲和力,以优化候选药物。虽然基于平衡热力学的方法已经得到了广泛的研究,但最近报道了一种基于非平衡方法的方法,并声称其具有优越性。然而,这些说法并没有充分关注这两种方法的基础和可靠性。在这里,我们报告了对这两种方法的大规模数据集的比较研究,该数据集包括超过 500 个配体转化,涵盖了超过 300 个配体与 14 个不同的蛋白质靶标结合。集合方法对于量化这些计算中的不确定性至关重要,这不仅是平衡方法中已经确立的原因,也是为了确保非平衡计算处于其有效范围内。只有在应用集合方法的情况下,我们才发现非平衡方法可以达到与平衡方法相当的准确性和精度。与平衡方法相比,非平衡方法可以降低计算成本,但会引入更高的计算复杂性和更长的运行时间。然而,在某些情况下,非平衡转换的标准长度是不够的,需要完全重新运行整个转换集。这会显著增加计算成本,并且在大规模应用中非常不方便。我们的研究结果提供了一套关键的建议,应该在配体-蛋白质系统中可靠地实施非平衡相对结合自由能计算方法。