Lau K F, Dill K A
Department of Pharmaceutical Chemistry, University of California, San Francisco 94143.
Proc Natl Acad Sci U S A. 1990 Jan;87(2):638-42. doi: 10.1073/pnas.87.2.638.
Using an elementary physical model for protein folding, of self-avoiding short copolymer chains on two-dimensional square lattices, we address two questions regarding the evolution and origins of globular proteins. (i) How will protein native structures and stabilities be affected by single-and double-site mutations? (ii) What is the probability that a randomly chosen sequence of amino acids will be compact and globular under folding conditions? For a large number of different sequences, we search the conformational space exhaustively to find unequivocally the "native" conformation(s), of global minimum free energy, for each sequence. We find that replacing nonpolar residues in the core by polar residues is generally destabilizing, that surface sites are less sensitive than core sites, that some mutations increase the degeneracy of native states, and that overall it is most probable that a mutation will be neutral, having no effect on the native structure. These results support a "Continuity Principle," that small changes in sequence seldom have large effects on structure or stability of the native state. The simulations also show that (i) the number of "convergent" sequences (different sequences coding for the same native structure) is extremely large and (ii) most sequences become quite dense under folding conditions. This implies that the probability of formation of a globular protein from a random sequence of amino acids by prebiotic or mutational methods is significantly greater than zero.
我们使用一个蛋白质折叠的基本物理模型,即二维正方晶格上的自回避短共聚物链,来探讨关于球状蛋白质的演化和起源的两个问题。(i)单一位点和双位点突变将如何影响蛋白质的天然结构和稳定性?(ii)在折叠条件下,随机选择的氨基酸序列形成紧密球状结构的概率是多少?对于大量不同的序列,我们详尽地搜索构象空间,以明确找到每个序列的全局最小自由能的“天然”构象。我们发现,用极性残基取代核心中的非极性残基通常会使结构不稳定,表面位点比核心位点更不敏感,一些突变会增加天然态的简并性,并且总体而言,最有可能的是突变是中性的,对天然结构没有影响。这些结果支持了“连续性原理”,即序列的小变化很少对天然态的结构或稳定性产生大的影响。模拟还表明:(i)“收敛”序列(编码相同天然结构的不同序列)的数量极大;(ii)大多数序列在折叠条件下会变得相当致密。这意味着通过益生元或突变方法由随机氨基酸序列形成球状蛋白质的概率显著大于零。