Ouyang Zheng, Liang Jie
Department of Bioengineering, University of Illinois at Chicago, Chicago, Illinois 60607, USA.
Protein Sci. 2008 Jul;17(7):1256-63. doi: 10.1110/ps.034660.108. Epub 2008 Apr 23.
Protein folding speeds are known to vary over more than eight orders of magnitude. Plaxco, Simons, and Baker (see References) first showed a correlation of folding speed with the topology of the native protein. That and subsequent studies showed, if the native structure of a protein is known, its folding speed can be predicted reasonably well through a correlation with the "localness" of the contacts in the protein. In the present work, we develop a related measure, the geometric contact number, N (alpha), which is the number of nonlocal contacts that are well-packed, by a Voronoi criterion. We find, first, that in 80 proteins, the largest such database of proteins yet studied, N (alpha) is a consistently excellent predictor of folding speeds of both two-state fast folders and more complex multistate folders. Second, we show that folding rates can also be predicted from amino acid sequences directly, without the need to know the native topology or other structural properties.
已知蛋白质折叠速度的变化范围超过八个数量级。普拉克斯科、西蒙斯和贝克(见参考文献)首先揭示了折叠速度与天然蛋白质拓扑结构之间的相关性。该项研究及后续研究表明,如果已知蛋白质的天然结构,那么通过与蛋白质中接触的“局部性”建立相关性,就能够较为准确地预测其折叠速度。在本研究中,我们开发了一种相关度量——几何接触数N(α),它是根据Voronoi准则确定的紧密堆积的非局部接触的数量。首先,我们发现,在80种蛋白质(这是目前研究过的最大的蛋白质数据库)中,N(α)始终是两态快速折叠蛋白和更复杂的多态折叠蛋白折叠速度的出色预测指标。其次,我们表明,无需知道天然拓扑结构或其他结构特性,仅从氨基酸序列也可以预测折叠速率。