Belcaid Mahdi, Bergeron Anne, Poisson Guylaine
Information and Computer Sciences, University of Hawaii, Honolulu, HI, USA.
J Comput Biol. 2010 Sep;17(9):1315-26. doi: 10.1089/cmb.2010.0108.
Comparing the genomes of two closely related viruses often produces mosaics where nearly identical sequences alternate with sequences that are unique to each genome. When several closely related genomes are compared, the unique sequences are likely to be shared with third genomes, leading to virus mosaic communities. Here we present comparative analysis of sets of Staphylococcus aureus phages that share large identical sequences with up to three other genomes, and with different partners along their genomes. We introduce mosaic graphs to represent these complex recombination events, and use them to illustrate the breath and depth of sequence sharing: some genomes are almost completely made up of shared sequences, while genomes that share very large identical sequences can adopt alternate functional modules. Mosaic graphs also allow us to identify breakpoints that could eventually be used for the construction of recombination networks. These findings have several implications on phage metagenomics assembly, on the horizontal gene transfer paradigm, and more generally on the understanding of the composition and evolutionary dynamics of virus communities.
比较两种亲缘关系密切的病毒的基因组,常常会产生镶嵌现象,即几乎相同的序列与每个基因组特有的序列交替出现。当比较几个亲缘关系密切的基因组时,这些独特的序列很可能会与第三个基因组共享,从而形成病毒镶嵌群落。在这里,我们展示了对金黄色葡萄球菌噬菌体组的比较分析,这些噬菌体与其他多达三个基因组共享大量相同序列,并且在其基因组上与不同的伙伴共享。我们引入镶嵌图来表示这些复杂的重组事件,并用它们来说明序列共享的广度和深度:一些基因组几乎完全由共享序列组成,而共享非常大的相同序列的基因组可以采用交替的功能模块。镶嵌图还使我们能够识别最终可用于构建重组网络的断点。这些发现对噬菌体宏基因组组装、水平基因转移范式以及更广泛地对病毒群落的组成和进化动态的理解都有若干影响。