Department of Theoretical Chemistry, Lund University, SE-221 00 Lund, Sweden.
J Comput Chem. 2010 Mar;31(4):837-46. doi: 10.1002/jcc.21366.
The molecular mechanics/generalized Born surface area (MM/GBSA) method has been investigated with the aim of achieving a statistical precision of 1 kJ/mol for the results. We studied the binding of seven biotin analogues to avidin, taking advantage of the fact that the protein is a tetramer with four independent binding sites, which should give the same estimated binding affinities. We show that it is not enough to use a single long simulation (10 ns), because the standard error of such a calculation underestimates the difference between the four binding sites. Instead, it is better to run several independent simulations and average the results. With such an approach, we obtain the same results for the four binding sites, and any desired precision can be obtained by running a proper number of simulations. We discuss how the simulations should be performed to optimize the use of computer time. The correlation time between the MM/GBSA energies is approximately 5 ps and an equilibration time of 100 ps is needed. For MM/GBSA, we recommend a sampling time of 20-200 ps for each separate simulation, depending on the protein. With 200 ps production time, 5-50 separate simulations are required to reach a statistical precision of 1 kJ/mol (800-8000 energy calculations or 1.5-15 ns total simulation time per ligand) for the seven avidin ligands. This is an order of magnitude more than what is normally used, but such a number of simulations is needed to obtain statistically valid results for the MM/GBSA method.
我们研究了七种生物素类似物与亲和素的结合,利用蛋白质是具有四个独立结合位点的四聚体这一事实,这应该给出相同的估计结合亲和力。我们表明,仅仅使用单个长模拟(10 ns)是不够的,因为这种计算的标准误差低估了四个结合位点之间的差异。相反,最好运行几个独立的模拟并平均结果。通过这种方法,我们为四个结合位点获得了相同的结果,并且可以通过运行适当数量的模拟来获得任何所需的精度。我们讨论了如何执行模拟以优化计算机时间的使用。MM/GBSA 能量之间的相关时间约为 5 ps,需要 100 ps 的平衡时间。对于 MM/GBSA,我们建议对于每个单独的模拟,采样时间为 20-200 ps,具体取决于蛋白质。对于 200 ps 的生产时间,对于七个亲和素配体,需要 5-50 个单独的模拟才能达到 1 kJ/mol 的统计精度(每个配体 800-8000 个能量计算或总共 1.5-15 ns 的模拟时间)。这比通常使用的数量级要多得多,但对于 MM/GBSA 方法,需要如此多的模拟才能获得统计上有效的结果。