Klimov Dmitri K, Thirumalai D
Bioinformatics and Computational Biology Program, School of Computational Sciences, George Mason University, Manassas, VA 20110, USA.
J Mol Biol. 2005 Nov 11;353(5):1171-86. doi: 10.1016/j.jmb.2005.09.029. Epub 2005 Sep 29.
The influence of native connectivity of secondary structure elements (SSE) on folding is studied using coarse-grained models of proteins with mixed alpha and beta structure and the analysis of the structural database of wild-type proteins. We found that the distribution of SSE along a sequence determines the diversity of folding pathways. If alpha and beta SSE are localized in different parts of a sequence, the diversity of folding pathways is restricted. An even (symmetric) distribution of alpha and beta SSE with respect to sequence midpoint favors multiple folding routes. Simulations are supplemented by the database analysis of the distribution of SSE in wild-type protein sequences. On an average, two-thirds of wild-type proteins with mixed alpha and beta structure have symmetric distribution of alpha and beta SSE. The propensity for symmetric distribution of SSE is especially evident for large proteins with the number of SSE > or = 10. We suggest that symmetric SSE distribution in protein sequences may arise due to nearly random allocation of alpha and beta structure along wild-type sequences. The tendency of long sequences to misfold is perhaps compensated by the enhanced pathway diversity. In addition, folding pathways are shown to progress via hierarchic assembly of SSE in accordance with their proximity along a sequence. We demonstrate that under mild denaturation conditions folding and unfolding pathways are similar. However, the reversibility of folding/unfolding pathways is shown to depend on the distribution of SSE. If alpha and beta SSE are localized in different parts of a sequence, folding and unfolding pathways are likely to coincide.
利用具有混合α和β结构的蛋白质粗粒度模型以及野生型蛋白质结构数据库分析,研究了二级结构元件(SSE)的天然连接性对折叠的影响。我们发现SSE沿序列的分布决定了折叠途径的多样性。如果α和β SSE位于序列的不同部分,则折叠途径的多样性会受到限制。α和β SSE相对于序列中点的均匀(对称)分布有利于多种折叠途径。通过对野生型蛋白质序列中SSE分布的数据库分析对模拟进行了补充。平均而言,具有混合α和β结构的野生型蛋白质中有三分之二具有α和β SSE的对称分布。对于SSE数量≥10的大蛋白质,SSE对称分布的倾向尤为明显。我们认为蛋白质序列中SSE的对称分布可能是由于α和β结构沿野生型序列的近乎随机分配而产生的。长序列错误折叠的趋势可能通过增强的途径多样性得到补偿。此外,折叠途径显示是通过SSE根据其在序列中的接近程度进行分层组装而进行的。我们证明在温和变性条件下,折叠和展开途径是相似的。然而,折叠/展开途径的可逆性显示取决于SSE的分布。如果α和β SSE位于序列的不同部分,折叠和展开途径可能会重合。