Department of Pharmaceutical Sciences, University of California, Irvine, California 92697, USA.
Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, USA.
J Chem Phys. 2022 Apr 7;156(13):134115. doi: 10.1063/5.0088024.
Accurate and efficient simulation of the thermodynamics and kinetics of protein-ligand interactions is crucial for computational drug discovery. Multiensemble Markov Model (MEMM) estimators can provide estimates of both binding rates and affinities from collections of short trajectories but have not been systematically explored for situations when a ligand is decoupled through scaling of non-bonded interactions. In this work, we compare the performance of two MEMM approaches for estimating ligand binding affinities and rates: (1) the transition-based reweighting analysis method (TRAM) and (2) a Maximum Caliber (MaxCal) based method. As a test system, we construct a small host-guest system where the ligand is a single uncharged Lennard-Jones (LJ) particle, and the receptor is an 11-particle icosahedral pocket made from the same atom type. To realistically mimic a protein-ligand binding system, the LJ ϵ parameter was tuned, and the system was placed in a periodic box with 860 TIP3P water molecules. A benchmark was performed using over 80 µs of unbiased simulation, and an 18-state Markov state model was used to estimate reference binding affinities and rates. We then tested the performance of TRAM and MaxCal when challenged with limited data. Both TRAM and MaxCal approaches perform better than conventional Markov state models, with TRAM showing better convergence and accuracy. We find that subsampling of trajectories to remove time correlation improves the accuracy of both TRAM and MaxCal and that in most cases, only a single biased ensemble to enhance sampled transitions is required to make accurate estimates.
准确高效地模拟蛋白质-配体相互作用的热力学和动力学对于计算药物发现至关重要。多集 Markov 模型 (MEMM) 估计器可以从短轨迹集合中提供结合速率和亲和力的估计,但尚未系统地探索在通过非键相互作用缩放使配体解耦的情况下的性能。在这项工作中,我们比较了两种用于估计配体结合亲和力和速率的 MEMM 方法的性能:(1)基于跃迁的重加权分析方法 (TRAM) 和 (2) 基于最大口径 (MaxCal) 的方法。作为测试系统,我们构建了一个小分子-大环主体系统,其中配体是单个不带电荷的 Lennard-Jones (LJ) 粒子,受体是由相同原子类型制成的 11 个粒子二十面体口袋。为了真实模拟蛋白质-配体结合系统,我们调整了 LJ ϵ 参数,并将系统置于带有 860 个 TIP3P 水分子的周期性盒子中。使用超过 80 µs 的无偏模拟进行了基准测试,并使用 18 态 Markov 状态模型来估计参考结合亲和力和速率。然后,我们在面临有限数据时测试了 TRAM 和 MaxCal 的性能。TRAM 和 MaxCal 方法都比传统的 Markov 状态模型表现更好,TRAM 显示出更好的收敛性和准确性。我们发现,轨迹的子采样以去除时间相关性可以提高 TRAM 和 MaxCal 的准确性,并且在大多数情况下,仅需要一个增强采样跃迁的有偏集合即可进行准确估计。