Raval Alpan, Piana Stefano, Eastwood Michael P, Shaw David E
D. E. Shaw Research, New York, New York, 10036.
Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, 10032.
Protein Sci. 2016 Jan;25(1):19-29. doi: 10.1002/pro.2770. Epub 2015 Aug 30.
Molecular dynamics (MD) simulation is a well-established tool for the computational study of protein structure and dynamics, but its application to the important problem of protein structure prediction remains challenging, in part because extremely long timescales can be required to reach the native structure. Here, we examine the extent to which the use of low-resolution information in the form of residue-residue contacts, which can often be inferred from bioinformatics or experimental studies, can accelerate the determination of protein structure in simulation. We incorporated sets of 62, 31, or 15 contact-based restraints in MD simulations of ubiquitin, a benchmark system known to fold to the native state on the millisecond timescale in unrestrained simulations. One-third of the restrained simulations folded to the native state within a few tens of microseconds-a speedup of over an order of magnitude compared with unrestrained simulations and a demonstration of the potential for limited amounts of structural information to accelerate structure determination. Almost all of the remaining ubiquitin simulations reached near-native conformations within a few tens of microseconds, but remained trapped there, apparently due to the restraints. We discuss potential methodological improvements that would facilitate escape from these near-native traps and allow more simulations to quickly reach the native state. Finally, using a target from the Critical Assessment of protein Structure Prediction (CASP) experiment, we show that distance restraints can improve simulation accuracy: In our simulations, restraints stabilized the native state of the protein, enabling a reasonable structural model to be inferred.
分子动力学(MD)模拟是用于蛋白质结构和动力学计算研究的成熟工具,但其在蛋白质结构预测这一重要问题上的应用仍具有挑战性,部分原因是达到天然结构可能需要极长的时间尺度。在这里,我们研究了以残基-残基接触形式的低分辨率信息(通常可从生物信息学或实验研究中推断得出)在多大程度上能够加速模拟中蛋白质结构的确定。我们在泛素的MD模拟中纳入了62、31或15组基于接触的约束条件,泛素是一个基准系统,在无约束模拟中已知其在毫秒时间尺度上折叠成天然状态。三分之一的受约束模拟在几十微秒内折叠成天然状态——与无约束模拟相比加速了一个数量级以上,证明了有限数量的结构信息有加速结构确定的潜力。几乎所有其余的泛素模拟在几十微秒内达到了接近天然的构象,但显然由于约束条件而被困在那里。我们讨论了可能有助于从这些接近天然的陷阱中逃脱并使更多模拟快速达到天然状态的方法学改进。最后,使用蛋白质结构预测关键评估(CASP)实验中的一个目标,我们表明距离约束可以提高模拟准确性:在我们的模拟中,约束条件稳定了蛋白质的天然状态,从而能够推断出合理的结构模型。