Klimov D K, Thirumalai D
Institute for Physical Science and Technology, University of Maryland, College Park, MD 20742, USA.
J Mol Biol. 1998 Sep 18;282(2):471-92. doi: 10.1006/jmbi.1998.1997.
The nature of the nucleation-collapse mechanism in protein folding is probed using 27-mer and 36-mer lattice models. Three different forms for the interaction potentials are used. Three of the four 27-mer sequences have maximally compact and identical native state while the other has a non-compact native conformation. All the sequences fold thermodynamically and kinetically by a two-state process. Analysis of individual trajectories for each sequence using a self-organizing neural net algorithm shows that upon formation of a critical set of contacts the polypeptide chain rapidly reaches the native conformation which is consistent with a nucleation-collapse mechanism. The algorithm, which reduces the identification of the folding nucleus for each trajectory to one of pattern recognition, is used to show that there are multiple folding nuclei. There is a distribution of nucleation contacts in the transition states with some of them occurring with more probability (when averaged over the denatured ensemble) than others. We also show that there is a distribution in the size of the nuclei with the average number of residues in the folding nuclei being less than about one-third of the chain size. The fluctuations in the sizes of the nuclei are large, suggestive of a broad transition region. The folding nuclei, the structures of each are the corresponding transition states, have varying degree of overlap with the native conformation. The distribution of the radius of gyration of the transition states shows that these structures are an expanded form (by about 25% in the radius of gyration) of the native conformation. Local contacts are most dominant in the folding nuclei while a certain fraction of non-local contacts is necessary to stabilize the transition states. The search for the critical nuclei initially involves the formation of local contacts, while non-local contacts are formed later. The fractional values of PhiF for the two 27-mer mutants found by using the protein engineering protocol are consistent with the microscopic picture of partial formation of structures involving these residues in the transition state. These observations lead to a multiple folding nuclei (MFN) model for nucleation-collapse mechanism in protein folding. The major implication of the MFN model is that, even if the residues whose tertiary interactions are formed nearly completely in the transition state are mutated, it does not disrupt the nature of the nucleation-collapse mechanism. We analyze the experiments on chymotrypsin inhibitor 2 and alpha-spectrin SH3 domain and two circular permutants in light of the MFN model. It is shown that the PhiF-value analysis for these proteins gives considerable support to the MFN model. The theoretical and experimental studies give a coherent picture of the nucleation-collapse mechanism in which there is a distribution of folding nuclei with some more probable than others. The formation of any specific nucleus is not necessary for efficient two-state folding.
利用27聚体和36聚体晶格模型探究蛋白质折叠中核化 - 塌缩机制的本质。使用了三种不同形式的相互作用势。四个27聚体序列中的三个具有最大紧凑且相同的天然态,而另一个具有非紧凑的天然构象。所有序列在热力学和动力学上都通过双态过程折叠。使用自组织神经网络算法对每个序列的个体轨迹进行分析表明,在形成一组关键接触后,多肽链迅速达到天然构象,这与核化 - 塌缩机制一致。该算法将每个轨迹的折叠核识别简化为模式识别之一,用于表明存在多个折叠核。在过渡态中核化接触存在分布,其中一些接触出现的概率更高(当在变性系综上平均时)。我们还表明,核的大小存在分布,折叠核中平均残基数量小于链大小的约三分之一。核大小的波动很大,表明存在一个宽泛的过渡区域。折叠核,其每一个的结构都是相应的过渡态,与天然构象有不同程度的重叠。过渡态回转半径的分布表明这些结构是天然构象的一种扩展形式(回转半径增加约25%)。局部接触在折叠核中最为占主导地位,而一定比例的非局部接触对于稳定过渡态是必要的。对关键核的搜索最初涉及局部接触的形成,而非局部接触稍后形成。通过蛋白质工程方案发现的两个27聚体突变体的PhiF分数值与过渡态中涉及这些残基的结构部分形成的微观图景一致。这些观察结果导致了蛋白质折叠中核化 - 塌缩机制的多折叠核(MFN)模型。MFN模型的主要含义是,即使在过渡态中三级相互作用几乎完全形成的残基发生突变,也不会破坏核化 - 塌缩机制的本质。我们根据MFN模型分析了胰凝乳蛋白酶抑制剂2和α - 血影蛋白SH3结构域以及两个环形置换体的实验。结果表明,对这些蛋白质的PhiF值分析为MFN模型提供了相当大的支持。理论和实验研究给出了核化 - 塌缩机制的连贯图景,其中折叠核存在分布,一些比另一些更有可能。对于高效的双态折叠,形成任何特定的核并非必要。