Centre for Computational Science, Department of Chemistry, University College London, London, WC1H 0AJ, United Kingdom.
J Chem Inf Model. 2010 May 24;50(5):890-905. doi: 10.1021/ci100007w.
Accurate calculation of important thermodynamic properties, such as macromolecular binding free energies, is one of the principal goals of molecular dynamics simulations. However, single long simulation frequently produces incorrectly converged quantitative results due to inadequate sampling of conformational space in a feasible wall-clock time. Multiple short (ensemble) simulations have been shown to explore conformational space more effectively than single long simulations, but the two methods have not yet been thermodynamically compared. Here we show that, for end-state binding free energy determination methods, ensemble simulations exhibit significantly enhanced thermodynamic sampling over single long simulations and result in accurate and converged relative binding free energies that are reproducible to within 0.5 kcal/mol. Completely correct ranking is obtained for six HIV-1 protease variants bound to lopinavir with a correlation coefficient of 0.89 and a mean relative deviation from experiment of 0.9 kcal/mol. Multidrug resistance to lopinavir is enthalpically driven and increases through a decrease in the protein-ligand van der Waals interaction, principally due to the V82A/I84V mutation, and an increase in net electrostatic repulsion due to water-mediated disruption of protein-ligand interactions in the catalytic region. Furthermore, we correctly rank, to within 1 kcal/mol of experiment, the substantially increased chemical potency of lopinavir binding to the wild-type protease compared to saquinavir and show that lopinavir takes advantage of a decreased net electrostatic repulsion to confer enhanced binding. Our approach is dependent on the combined use of petascale computing resources and on an automated simulation workflow to attain the required level of sampling and turn around time to obtain the results, which can be as little as three days. This level of performance promotes integration of such methodology with clinical decision support systems for the optimization of patient-specific therapy.
准确计算重要的热力学性质,如大分子结合自由能,是分子动力学模拟的主要目标之一。然而,由于在可行的时钟时间内对构象空间的采样不足,单次长模拟经常产生不正确收敛的定量结果。已经证明,多个短(集合)模拟比单个长模拟更有效地探索构象空间,但这两种方法尚未在热力学上进行比较。在这里,我们表明,对于终态结合自由能测定方法,集合模拟在热力学上显著增强了对单个长模拟的采样,从而产生准确且收敛的相对结合自由能,其重现性在 0.5 千卡/摩尔以内。对于与洛匹那韦结合的六种 HIV-1 蛋白酶变体,我们获得了完全正确的排序,相关系数为 0.89,与实验的平均相对偏差为 0.9 千卡/摩尔。对洛匹那韦的多药耐药性是焓驱动的,通过降低蛋白质-配体范德华相互作用而增加,主要是由于 V82A/I84V 突变,以及由于水介导的催化区域中蛋白质-配体相互作用的破坏而增加的净静电排斥。此外,我们正确地对洛匹那韦与野生型蛋白酶结合的化学效力相对于沙奎那韦提高了 1 千卡/摩尔以内进行了排序,并表明洛匹那韦利用降低的净静电排斥来赋予增强的结合。我们的方法依赖于使用 petascale 计算资源和自动化模拟工作流程来达到所需的采样水平和周转时间以获得结果,这可能只需要三天。这种性能水平促进了将这种方法与临床决策支持系统集成,以优化患者特定的治疗。