Pushker Ravindra, Mira Alex, Rodríguez-Valera Francisco
Evolutionary Genomics Group, Universidad Miguel Hernández, Campus de San Juan, Apartado 18, 03550 San Juan de Alicante, Alicante, Spain.
Genome Biol. 2004;5(4):R27. doi: 10.1186/gb-2004-5-4-r27. Epub 2004 Mar 18.
The wealth of genomic data in bacteria is helping microbiologists understand the factors involved in gene innovation. Among these, the expansion and reduction of gene families appears to have a fundamental role in this, but the factors influencing gene family size are unclear.
The relative content of paralogous genes in bacterial genomes increases with genome size, largely due to the expansion of gene family size in large genomes. Bacteria undergoing genome reduction display a parallel process of redundancy elimination, by which gene families are reduced to one or a few members. Gene family size is also influenced by sequence divergence and physiological function. Large gene families show wider sequence divergence, suggesting they are probably older, and certain functions (such as metabolite transport mechanisms) are overrepresented in large families. The size of a given gene family is remarkably similar in strains of the same species and in closely related species, suggesting that homologous gene families are vertically transmitted and depend little on horizontal gene transfer (HGT).
The remarkable preservation of copy numbers in widely different ecotypes indicates a functional role for the different copies rather than simply a back-up role. When different genera are compared, the increase in phylogenetic distance and/or ecological specialization disrupts this preservation, albeit in a gradual manner and maintaining an overall similarity, which also supports this view. HGT can have an important role, however, in nonhomologous gene families, as exemplified by a comparison between saprophytic and enterohemorrhagic strains of Escherichia coli.
细菌基因组中丰富的基因数据有助于微生物学家了解基因创新所涉及的因素。其中,基因家族的扩增和缩减似乎在此过程中起着基础性作用,但影响基因家族大小的因素尚不清楚。
细菌基因组中旁系同源基因的相对含量随基因组大小增加,这主要是由于大基因组中基因家族大小的扩增。经历基因组缩减的细菌呈现出一个平行的冗余消除过程,在此过程中基因家族缩减为一个或几个成员。基因家族大小还受序列分歧和生理功能影响。大基因家族显示出更广泛的序列分歧,表明它们可能更为古老,并且某些功能(如代谢物转运机制)在大基因家族中过度富集。在同一物种的菌株以及亲缘关系较近的物种中,特定基因家族的大小非常相似,这表明同源基因家族是垂直传递的,并且很少依赖水平基因转移(HGT)。
在广泛不同的生态型中,拷贝数的显著保留表明不同拷贝具有功能作用,而非仅仅起备份作用。当比较不同属时,系统发育距离和/或生态特化的增加会破坏这种保留,尽管是以渐进的方式且总体上保持相似性,这也支持了这一观点。然而,水平基因转移在非同源基因家族中可能起重要作用,例如在大肠杆菌的腐生菌株和肠出血性菌株之间的比较中就有所体现。