Pakula A A, Sauer R T
Division of Biology, California Institute of Technology, Pasadena 91125.
Annu Rev Genet. 1989;23:289-310. doi: 10.1146/annurev.ge.23.120189.001445.
There is tremendous variability in the importance of individual amino acids in protein sequences. On the one hand, nonconservative residue substitutions can be tolerated with no loss of activity at many residue positions, especially those exposed on the protein surface. On the other hand, destabilizing mutations can occur at a large number of different sites in a protein, and for many proteins such mutations account for more than half of the randomly isolated missense mutations that confer a defective phenotype. At sites that are key determinants of stability or activity, even residue substitutions that are generally considered to be conservative (e.g., Glu in equilibrium Asp, Asn in equilibrium Asp, Ile in equilibrium Leu, Lys in equilibrium Arg and Ala in equilibrium Gly) can have severe phenotypic effects. Unfortunately, this means that there is no simple way to infer the likely effect of an amino acid substitution on the basis of sequence information alone. A nonconservative Gly----Arg substitution could be phenotypically silent at one position while a conservative Asn----Asp change could lead to complete loss of activity at another position. For proteins whose structures are known, it is often possible to predict whether particular residue substitutions will be destabilizing, as long as detailed estimates of the destabilization energy are not required. Substitutions that introduce polar groups, large cavities, or overly large side chains into the hydrophobic core are potentially the most destabilizing. Substitutions that disrupt hydrogen bonding or electrostatic interactions can also have significant effects, although the destabilization caused by these substitutions is smaller than that caused by severe core mutations. Destabilizing substitutions that involve replacing glycines in turns, or introducing prolines into alpha-helices and other disallowed positions are also reasonably common. Finally, most solvent exposed residues can apparently be freely substituted without serious effects on protein stability. Although exceptions may occur, these generalizations serve to summarize a large body of information and can be rationalized in physical and chemical terms. It is an especially encouraging result that proteins appear to tolerate most substitutions, even those that are destabilizing, without significant changes in the native structure. For proteins whose structures are known, this means that it is reasonable to interpret mutant phenotypes in terms of the wild-type structure. For proteins whose structures are not known, it is reasonable to infer that mutations that reduce activity without affecting stability are directly involved in function.(ABSTRACT TRUNCATED AT 400 WORDS)
蛋白质序列中单个氨基酸的重要性存在巨大差异。一方面,在许多残基位置,尤其是蛋白质表面暴露的位置,非保守性残基替换可以被容忍而不丧失活性。另一方面,不稳定突变可发生在蛋白质的大量不同位点,对于许多蛋白质而言,此类突变在导致缺陷表型的随机分离错义突变中占比超过一半。在作为稳定性或活性关键决定因素的位点,即使是通常被认为是保守的残基替换(例如,平衡状态下的Glu替换Asp、Asn替换Asp、Ile替换Leu、Lys替换Arg以及Ala替换Gly)也可能产生严重的表型效应。不幸的是,这意味着仅根据序列信息没有简单的方法来推断氨基酸替换可能产生的影响。一个非保守的Gly→Arg替换在一个位置可能在表型上没有影响,而一个保守的Asn→Asp变化在另一个位置可能导致活性完全丧失。对于结构已知的蛋白质,只要不需要对去稳定化能量进行详细估计,通常就有可能预测特定的残基替换是否会导致不稳定。向疏水核心引入极性基团、大的空洞或过大的侧链的替换可能是最不稳定的。破坏氢键或静电相互作用的替换也可能有显著影响,尽管这些替换引起的不稳定比严重的核心突变引起的要小。涉及依次替换甘氨酸或在α螺旋和其他不允许的位置引入脯氨酸的不稳定替换也相当常见。最后,大多数暴露于溶剂中的残基显然可以自由替换而不会对蛋白质稳定性产生严重影响。尽管可能会有例外情况,但这些概括有助于总结大量信息,并且可以从物理和化学角度进行合理化解释。特别令人鼓舞的结果是,蛋白质似乎能够容忍大多数替换,即使是那些导致不稳定的替换,而天然结构不会有显著变化。对于结构已知的蛋白质,这意味着根据野生型结构来解释突变体表型是合理可行的。对于结构未知的蛋白质,可以合理推断那些降低活性但不影响稳定性的突变直接参与了功能。(摘要截选至400字)