Suppr超能文献

SAMPL6抽样挑战:评估结合自由能计算的可靠性和效率。

The SAMPL6 SAMPLing challenge: assessing the reliability and efficiency of binding free energy calculations.

作者信息

Rizzi Andrea, Jensen Travis, Slochower David R, Aldeghi Matteo, Gapsys Vytautas, Ntekoumes Dimitris, Bosisio Stefano, Papadourakis Michail, Henriksen Niel M, de Groot Bert L, Cournia Zoe, Dickson Alex, Michel Julien, Gilson Michael K, Shirts Michael R, Mobley David L, Chodera John D

机构信息

Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA.

Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY, 10065, USA.

出版信息

J Comput Aided Mol Des. 2020 May;34(5):601-633. doi: 10.1007/s10822-020-00290-5. Epub 2020 Jan 27.

Abstract

Approaches for computing small molecule binding free energies based on molecular simulations are now regularly being employed by academic and industry practitioners to study receptor-ligand systems and prioritize the synthesis of small molecules for ligand design. Given the variety of methods and implementations available, it is natural to ask how the convergence rates and final predictions of these methods compare. In this study, we describe the concept and results for the SAMPL6 SAMPLing challenge, the first challenge from the SAMPL series focusing on the assessment of convergence properties and reproducibility of binding free energy methodologies. We provided parameter files, partial charges, and multiple initial geometries for two octa-acid (OA) and one cucurbit[8]uril (CB8) host-guest systems. Participants submitted binding free energy predictions as a function of the number of force and energy evaluations for seven different alchemical and physical-pathway (i.e., potential of mean force and weighted ensemble of trajectories) methodologies implemented with the GROMACS, AMBER, NAMD, or OpenMM simulation engines. To rank the methods, we developed an efficiency statistic based on bias and variance of the free energy estimates. For the two small OA binders, the free energy estimates computed with alchemical and potential of mean force approaches show relatively similar variance and bias as a function of the number of energy/force evaluations, with the attach-pull-release (APR), GROMACS expanded ensemble, and NAMD double decoupling submissions obtaining the greatest efficiency. The differences between the methods increase when analyzing the CB8-quinine system, where both the guest size and correlation times for system dynamics are greater. For this system, nonequilibrium switching (GROMACS/NS-DS/SB) obtained the overall highest efficiency. Surprisingly, the results suggest that specifying force field parameters and partial charges is insufficient to generally ensure reproducibility, and we observe differences between seemingly converged predictions ranging approximately from 0.3 to 1.0 kcal/mol, even with almost identical simulations parameters and system setup (e.g., Lennard-Jones cutoff, ionic composition). Further work will be required to completely identify the exact source of these discrepancies. Among the conclusions emerging from the data, we found that Hamiltonian replica exchange-while displaying very small variance-can be affected by a slowly-decaying bias that depends on the initial population of the replicas, that bidirectional estimators are significantly more efficient than unidirectional estimators for nonequilibrium free energy calculations for systems considered, and that the Berendsen barostat introduces non-negligible artifacts in expanded ensemble simulations.

摘要

基于分子模拟计算小分子结合自由能的方法如今正被学术界和工业界的从业者广泛用于研究受体 - 配体系统,并为配体设计确定小分子合成的优先级。鉴于现有多种方法和实现方式,自然而然会问这些方法的收敛速度和最终预测结果如何比较。在本研究中,我们描述了SAMPL6采样挑战的概念和结果,这是SAMPL系列中的首个挑战,重点在于评估结合自由能方法的收敛特性和可重复性。我们为两个八酸(OA)和一个葫芦[8]脲(CB8)主客体系统提供了参数文件、部分电荷和多个初始几何结构。参与者针对使用GROMACS、AMBER、NAMD或OpenMM模拟引擎实现的七种不同的炼金术和物理路径(即平均力势和轨迹加权系综)方法,提交了作为力和能量评估次数函数的结合自由能预测。为了对这些方法进行排名,我们基于自由能估计的偏差和方差开发了一种效率统计量。对于两个较小的OA结合物,用炼金术方法和平均力势方法计算得到的自由能估计值,作为能量/力评估次数的函数,显示出相对相似的方差和偏差,其中附着 - 拉动 - 释放(APR)、GROMACS扩展系综和NAMD双重解耦提交的方法效率最高。在分析CB8 - 奎宁系统时,方法之间的差异增大,该系统中客体尺寸和系统动力学的相关时间都更大。对于这个系统,非平衡切换(GROMACS/NS - DS/SB)总体效率最高。令人惊讶的是,结果表明指定力场参数和部分电荷通常不足以确保可重复性,并且我们观察到即使模拟参数和系统设置几乎相同(例如, Lennard - Jones截断、离子组成),看似收敛的预测结果之间仍存在约0.3至1.0千卡/摩尔的差异。需要进一步开展工作以完全确定这些差异的确切来源。从数据中得出的结论包括,我们发现哈密顿量副本交换虽然方差非常小,但可能会受到一种缓慢衰减偏差的影响该偏差取决于副本的初始分布;对于所考虑的系统,双向估计器在非平衡自由能计算中比单向估计器效率显著更高;并且Berendsen压力调节器在扩展系综模拟中会引入不可忽略的伪影。

相似文献

2
Overview of the SAMPL6 host-guest binding affinity prediction challenge.SAMPL6 主客体结合亲和力预测挑战概述。
J Comput Aided Mol Des. 2018 Oct;32(10):937-963. doi: 10.1007/s10822-018-0170-6. Epub 2018 Nov 10.
3
SAMPL6 host-guest challenge: binding free energies via a multistep approach.SAMPL6 主客体挑战:通过多步骤方法计算结合自由能。
J Comput Aided Mol Des. 2018 Oct;32(10):1097-1115. doi: 10.1007/s10822-018-0159-1. Epub 2018 Sep 17.

引用本文的文献

4
Automated Adaptive Absolute Binding Free Energy Calculations.自动自适应绝对结合自由能计算
J Chem Theory Comput. 2024 Sep 10;20(18):7806-28. doi: 10.1021/acs.jctc.4c00806.

本文引用的文献

4
8
Overview of the SAMPL6 host-guest binding affinity prediction challenge.SAMPL6 主客体结合亲和力预测挑战概述。
J Comput Aided Mol Des. 2018 Oct;32(10):937-963. doi: 10.1007/s10822-018-0170-6. Epub 2018 Nov 10.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验