J. Craig Venter Institute, 4120 Capricorn Lane, La Jolla, CA 92037, USA.
Microb Genom. 2016 Jul 18;2(7):e000062. doi: 10.1099/mgen.0.000062. eCollection 2016 Jul.
Insertion sequence (IS) elements are important mediators of genome plasticity and can lead to phenotypic changes with evolutionary significance. In multidrug-resistant and , IS elements have contributed significantly to the mobilization of genes that encode resistance to antimicrobial drugs. A systematic analysis of IS elements is needed for a more comprehensive understanding of their evolutionary impact. We developed a computational approach (ISseeker) to annotate IS elements in draft genome assemblies and applied the method to analysis of IS elements in all publicly available (>1000) and (>800) genome sequences, in a phylogenetic context. Most IS elements in genomes are species-specific IS elements, whereas genomes contain significant numbers of both IS elements and elements that are found throughout the Enterobacteriaceae. genomes have a higher density of IS elements than averaging ~33 vs ~27 copies per genome. In , several insertion sites are shared by most genomes in the ST258 clade, whereas in , different IS elements are abundant in different phylogenetic groups, even among closely related Global Clone 2 strains. IS elements differ in the distribution of insertion locations relative to genes, with some more likely to disrupt genes and others predominantly in intergenic regions. Several genes and intergenic regions had multiple independent insertion events, suggesting that those events may confer a selective advantage. Genome- and taxon-wide characterization of insertion locations revealed that IS elements have been active contributors to genome diversity in both species.
插入序列 (IS) 元件是基因组可塑性的重要介质,可导致具有进化意义的表型变化。在多药耐药和中,IS 元件极大地促进了编码对抗生素药物耐药性的基因的转移。为了更全面地了解它们的进化影响,需要对 IS 元件进行系统分析。我们开发了一种计算方法(ISseeker)来注释草案基因组组装中的 IS 元件,并将该方法应用于所有公开可用的 (>1000) 和 (>800) 基因组序列(在系统发育背景下)中的 IS 元件分析。大多数基因组中的 IS 元件是种特异性 IS 元件,而基因组中既有 IS 元件,也有在肠杆菌科中普遍存在的元件。基因组中的 IS 元件密度高于,平均每个基因组约有 33 个拷贝,而只有 27 个拷贝。在 ST258 进化枝的大多数基因组中,有几个插入位点是共享的,而在 中,不同的 IS 元件在不同的系统发育群中丰富,即使是在密切相关的全球克隆 2 株中也是如此。IS 元件在插入位置相对于基因的分布上存在差异,有些更有可能破坏基因,而有些则主要在基因间区域。几个基因和基因间区域发生了多次独立的插入事件,这表明这些事件可能赋予了选择优势。对插入位置进行全基因组和分类群特征分析表明,IS 元件一直是这两个物种基因组多样性的活跃贡献者。