MSU-DOE Plant Research Laboratory, Michigan State University, East Lansing, Michigan 48824, USA.
J Bacteriol. 2010 Oct;192(20):5289-303. doi: 10.1128/JB.00460-10. Epub 2010 Jul 23.
Anabaena sp. strain PCC 7120, widely studied, has 145 annotated transposase genes that are part of transposable elements called insertion sequences (ISs). To determine the entirety of the ISs, we aligned transposase genes and their flanking regions; identified the ISs' possible terminal inverted repeats, usually flanked by direct repeats; and compared IS-interrupted sequences with homologous sequences. We thereby determined both ends of 87 ISs bearing 110 transposase genes in eight IS families (http://www-is.biotoul.fr/) and in a cluster of unclassified ISs, and of hitherto unknown miniature inverted-repeat transposable elements. Open reading frames were then identified to which ISs contributed and others--some encoding proteins of predictable function, including protein kinases, and restriction endonucleases--that were interrupted by ISs. Anabaena sp. ISs were often more closely related to exogenous than to other endogenous ISs, suggesting that numerous variant ISs were not degraded within PCC 7120 but transferred from without. This observation leads to the expectation that further sequencing projects will extend this and similar analyses. We also propose an adaptive role for poly(A) sequences in ISs.
聚球藻 PCC 7120 是一种被广泛研究的蓝藻,它有 145 个注释的转座酶基因,这些基因是可移动元件(称为插入序列,ISs)的一部分。为了确定 ISs 的全部内容,我们对转座酶基因及其侧翼区域进行了比对;确定了 ISs 可能的末端反向重复序列,通常由直接重复序列侧翼;并将 IS 中断序列与同源序列进行了比较。我们因此确定了 8 个 IS 家族(http://www-is.biotoul.fr/)和一个未分类的 IS 簇中 87 个携带 110 个转座酶基因的 IS 的两端,以及以前未知的微型反向重复转座元件。然后确定了 IS 所贡献的开放阅读框,以及其他一些被 IS 中断的编码具有可预测功能的蛋白质的阅读框,包括蛋白激酶和限制内切酶。聚球藻的 ISs 通常与外源 ISs 比与其他内源性 ISs 更密切相关,这表明许多变体 ISs 并没有在 PCC 7120 中降解,而是从外部转移而来。这一观察结果导致人们期望进一步的测序项目将扩展这一和类似的分析。我们还提出了 IS 中聚(A)序列的适应性作用。