Krupa Paweł, Sieradzan Adam K, Rackovsky S, Baranowski Maciej, Ołldziej Stanisław, Scheraga Harold A, Liwo Adam, Czaplewski Cezary
Faculty of Chemistry, University of Gdańsk, Wita Stwosza 63, 80-952 Gdańsk, Poland.
Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, N.Y., 14853-1301, U.S.A.
J Chem Theory Comput. 2013 Oct 8;9(10). doi: 10.1021/ct4004977.
The UNited RESidue (UNRES) coarse-grained model of polypeptide chains, developed in our laboratory, enables us to carry out millisecond-scale molecular-dynamics simulations of large proteins effectively. It performs well in predictions of protein structure, as demonstrated in the last Community Wide Experiment on the Critical Assessment of Techniques for Protein Structure Prediction (CASP10). However, the resolution of the simulated structure is too coarse, especially in loop regions, which results from insufficient specificity of the model of local interactions. To improve the representation of local interactions, in this work we introduced new side-chain-backbone correlation potentials, derived from a statistical analysis of loop regions of 4585 proteins. To obtain sufficient statistics, we reduced the set of amino-acid-residue types to five groups, derived in our earlier work on structurally optimized reduced alphabets, based on a statistical analysis of the properties of amino-acid structures. The new correlation potentials are expressed as one-dimensional Fourier series in the virtual-bond-dihedral angles involving side-chain centroids. The weight of these new terms was determined by a trial-and-error method, in which Multiplexed Replica Exchange Molecular Dynamics (MREMD) simulations were run on selected test proteins. The best average root-mean-square deviations (RMSDs) of the calculated structures from the experimental structures below the folding-transition temperatures were obtained with the weight of the new side-chain-backbone correlation potentials equal to 0.57. The resulting conformational ensembles were analyzed in detail by using the Weighted Histogram Analysis Method (WHAM) and Ward's minimum-variance clustering. This analysis showed that the RMSDs from the experimental structures dropped by 0.5 Å on average, compared to simulations without the new terms, and the deviation of individual residues in the loop region of the computed structures from their counterparts in the experimental structures (after optimum superposition of the calculated and experimental structure) decreased by up to 8 Å. Consequently, the new terms improve the representation of local structure.
我们实验室开发的多肽链联合残基(UNRES)粗粒度模型,使我们能够有效地对大型蛋白质进行毫秒级分子动力学模拟。如上次蛋白质结构预测技术关键评估社区范围实验(CASP10)所示,它在蛋白质结构预测方面表现良好。然而,模拟结构的分辨率太粗糙,尤其是在环区,这是由于局部相互作用模型的特异性不足所致。为了改进局部相互作用的表示,在这项工作中,我们引入了新的侧链-主链相关势,该势源自对4585种蛋白质环区的统计分析。为了获得足够的统计数据,我们基于对氨基酸结构性质的统计分析,将氨基酸残基类型集减少到五组,这是我们早期在结构优化简化字母表的工作中得出的。新的相关势在涉及侧链质心的虚拟键二面角中表示为一维傅里叶级数。这些新项的权重通过试错法确定,其中对选定的测试蛋白质进行了多重副本交换分子动力学(MREMD)模拟。当新的侧链-主链相关势的权重等于0.57时,计算结构与实验结构在折叠转变温度以下的最佳平均均方根偏差(RMSD)得以获得。使用加权直方图分析方法(WHAM)和沃德最小方差聚类对所得构象集合进行了详细分析。该分析表明,与没有新项的模拟相比,与实验结构的RMSD平均下降了0.5 Å,并且计算结构环区中单个残基与其在实验结构中的对应残基(在计算结构和实验结构进行最佳叠加之后)的偏差减少了多达8 Å。因此,新项改进了局部结构的表示。