Labedan B, Riley M
Institut de Génétique et Microbiologie, Université de Paris-Sud.
Mol Biol Evol. 1995 Nov;12(6):980-7. doi: 10.1093/oxfordjournals.molbev.a040291.
Sequences of 1,862 chromosomally encoded Escherichia coli K12 proteins were examined to identify genes likely to have arisen by duplication of genes in an ancestral chromosome. The criteria for sequence relatedness were an alignment of at least 100 amino acid residues and a PAM distance (number of accepted point mutations per 100 residues separating two sequences) below 250. A total of 971 of the 1,862 proteins examined were found in 2,329 sequence-related pairs that met these criteria. Most proteins of the sequence-related pairs were related in cellular function, as judged by biochemical and/or physiological features. Many of the pairs of proteins could be grouped into sequence-related families. If such groupings were generated from ancestral genes by duplication and divergence events, through these sequence comparisons we can identify putative ancestral sequences of the present-day genes of E. coli and other organisms. The results suggest that the 971 paralogous genes could have been derived from only 204 ancestral genes. We have also shown that the process of duplication and divergence is not the exclusive mechanism of evolution of all E. coli genes. Indeed, the relationships among the sequences of multiple (in the sense of redundant) enzymes indicate that nearly half could have arisen either by convergent evolution or by lateral transfer. Therefore, not all functionally related genes need arise by duplication and divergence.
对1862个染色体编码的大肠杆菌K12蛋白序列进行了检测,以确定可能由祖先染色体中的基因复制产生的基因。序列相关性的标准是至少100个氨基酸残基的比对以及低于250的PAM距离(分隔两个序列的每100个残基中接受的点突变数)。在接受检测的1862个蛋白中,共有971个存在于满足这些标准的2329个序列相关对中。根据生化和/或生理特征判断,序列相关对中的大多数蛋白在细胞功能上相关。许多蛋白对可以归为序列相关家族。如果这些分组是通过复制和分化事件从祖先基因产生的,那么通过这些序列比较,我们可以确定大肠杆菌和其他生物体当今基因的推定祖先序列。结果表明,这971个旁系同源基因可能仅来自204个祖先基因。我们还表明,复制和分化过程并非大肠杆菌所有基因进化的唯一机制。事实上,多种(从冗余意义上讲)酶的序列之间的关系表明,近一半可能是通过趋同进化或横向转移产生的。因此,并非所有功能相关的基因都需要通过复制和分化产生。