Schmitt Armin O, Schuchhardt Johannes, Ludwig Arne, Brockmann Gudrun A
Institute for Animal Sciences, Humboldt-Universität zu Berlin, Invalidenstrasse 42, 10115 Berlin, Germany.
J Theor Biol. 2007 Nov 21;249(2):376-83. doi: 10.1016/j.jtbi.2007.08.001. Epub 2007 Aug 8.
Protein evolution can be seen as the successive replacement of amino acids by other amino acids. In general, it is a very slow process which is triggered by point mutations in the nucleotide sequence. These mutations can transform into single nucleotide polymorphisms (SNPs) within populations and diverging proteins between species. It is well known that in many cases amino acids can be replaced by others without impeding the functioning of the protein, even if these are of quite different physico-chemical character. In some cases, however, almost any replacement would result in a functionally deficient protein. Based upon comprehensive published SNP data and applying correlation analysis we quantified the two antagonist factors controlling the process of amino acid replacement and thus protein evolution: First, the degenerate structure of the genetic code which facilitates the exchange of certain amino acids and, second, the physico-chemical forces which limit the range of possible exchanges to maintain a functional protein. We found that the observed frequencies of amino acid exchanges within species are best explained by the genetic code and that the conservation of physico-chemical properties plays a subordinate role, but has nevertheless to be considered as a key factor. Between moderately diverged species genetic code and physico-chemical properties exert comparable influence on amino acid exchanges. We furthermore studied amino acid exchanges in more detail for six species (four mammals, one bird, and one insect) and found that the profiles are highly correlated across all examined species despite their large evolutionary divergence of up to 800 million years. The species specific exchange profiles are also correlated to the exchange profile observed between different species. The currently available huge body of SNP data allows to characterize the role of two major shaping forces of protein evolution more quantitatively than before.
蛋白质进化可以看作是氨基酸被其他氨基酸依次取代的过程。一般来说,这是一个非常缓慢的过程,由核苷酸序列中的点突变引发。这些突变可在种群内转变为单核苷酸多态性(SNP),并导致物种间蛋白质的分化。众所周知,在许多情况下,氨基酸可被其他氨基酸取代而不妨碍蛋白质的功能,即便这些氨基酸的物理化学性质差异很大。然而,在某些情况下,几乎任何取代都会导致蛋白质功能缺陷。基于已发表的综合SNP数据并应用相关性分析,我们量化了控制氨基酸取代过程进而蛋白质进化的两个拮抗因素:其一,遗传密码的简并结构促进了某些氨基酸的交换;其二,物理化学作用力限制了可能的交换范围以维持蛋白质的功能。我们发现,物种内观察到的氨基酸交换频率最好由遗传密码来解释,物理化学性质的保守性起次要作用,但仍应被视为关键因素。在中等分化的物种之间,遗传密码和物理化学性质对氨基酸交换的影响相当。我们还更详细地研究了六个物种(四种哺乳动物、一种鸟类和一种昆虫)的氨基酸交换情况,发现尽管它们在进化上有高达8亿年的巨大差异,但所有被研究物种的交换模式高度相关。物种特异性的交换模式也与不同物种间观察到的交换模式相关。目前可用的大量SNP数据使得比以往更定量地描述蛋白质进化的两种主要塑造力的作用成为可能。