The Wellcome Trust Sanger Institute, Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
BMC Genomics. 2009 Dec 7;10:584. doi: 10.1186/1471-2164-10-584.
Rhs genes are prominent features of bacterial genomes that have previously been implicated in genomic rearrangements in E. coli. By comparing rhs repertoires across the Enterobacteriaceae, this study provides a robust explanation of rhs diversification and evolution, and a mechanistic model of how rhs diversity is gained and lost.
Rhs genes are ubiquitous and comprise six structurally distinct lineages within the Enterobacteriaceae. There is considerable intergenomic variation in rhs repertoire; for instance, in Salmonella enterica, rhs are restricted to mobile elements, while in Escherichia coli one rhs lineage has diversified through transposition as older lineages have been deleted. Overall, comparative genomics reveals frequent, independent gene gains and losses, as well as occasional lateral gene transfer, in different genera. Furthermore, we demonstrate that Rhs 'core' domains and variable C-termini are evolutionarily decoupled, and propose that rhs diversity is driven by homologous recombination with circular intermediates. Existing C-termini are displaced by laterally acquired alternatives, creating long arrays of dissociated 'tips' that characterize the appearance of rhs loci.
Rhs repertoires are highly dynamic among Enterobacterial genomes, due to repeated gene gains and losses. In contrast, the primary structures of Rhs genes are evolutionarily conserved, indicating that rhs sequence diversity is driven, not by rapid mutation, but by the relatively slow evolution of novel core/tip combinations. Hence, we predict that a large pool of dissociated rhs C-terminal tips exists episomally and these are potentially transmitted across taxonomic boundaries.
Rhs 基因是细菌基因组中的显著特征,先前曾被认为与大肠杆菌中的基因组重排有关。通过比较肠杆菌科中的 rhs 库,本研究为 rhs 的多样化和进化提供了一个强有力的解释,并提出了一个关于 rhs 多样性如何获得和丧失的机制模型。
Rhs 基因普遍存在,在肠杆菌科中包含六个结构上不同的谱系。rhs 库在基因组间存在相当大的变异;例如,在沙门氏菌中,rhs 仅限于移动元件,而在大肠杆菌中,一个 rhs 谱系通过转座而多样化,而较旧的谱系则被删除。总体而言,比较基因组学揭示了不同属中频繁的、独立的基因获得和丧失,以及偶尔的水平基因转移。此外,我们证明了 Rhs 的“核心”结构域和可变 C 末端在进化上是分离的,并提出 rhs 多样性是由与圆形中间产物的同源重组驱动的。现有的 C 末端被侧向获得的替代物取代,形成了长的分离“尖端”阵列,这是 rhs 基因座出现的特征。
由于基因的反复获得和丧失,rhs 库在肠杆菌基因组中高度动态。相比之下,Rhs 基因的一级结构在进化上是保守的,这表明 rhs 序列多样性不是由快速突变驱动的,而是由新的核心/尖端组合的相对缓慢进化驱动的。因此,我们预测存在大量分离的 rhs C 末端尖端以附加体的形式存在,并且这些尖端可能在分类学边界之间传播。