Bowen Nathan J, Jordan I King, Epstein Jonathan A, Wood Valerie, Levin Henry L
Section on Eukaryotic Transposable Elements, Laboratory of Gene Regulation and Development, National Institute of Child Health and Human Development (NICHD), National Institutes of Health (NIH), Bethesda, Maryland 20892, USA.
Genome Res. 2003 Sep;13(9):1984-97. doi: 10.1101/gr.1191603.
The complete DNA sequence of the genome of Schizosaccharomyces pombe provides the opportunity to investigate the entire complement of transposable elements (TEs), their association with specific sequences, their chromosomal distribution, and their evolution. Using homology-based sequence identification, we found that the sequenced strain of S. pombe contained only one family of full-length transposons. This family, Tf2, consisted of 13 full-length copies of a long terminal repeat (LTR) retrotransposon. We found that LTR-LTR recombination of previously existing transposons had resulted in extensive populations of solo LTRs. These included 35 solo LTRs of Tf2, as well as 139 solo LTRs from other Tf families. Phylogenetic analysis of solo Tf LTRs reveals that Tf1 and Tf2 were the most recently active elements within the genome. The solo LTRs also served as footprints for previous insertion events by the Tf retrotransposons. Analysis of 186 genomic insertion events revealed a close association with RNA polymerase II promoters. These insertions clustered in the promoter-proximal regions of genes, upstream of protein coding regions by 100 to 400 nucleotides. The association of Tf insertions with pol II promoters was very similar to the preference previously observed for Tf1 integration. We found that the recently active Tf elements were absent from centromeres and pericentromeric regions of the genome containing tandem tRNA gene clusters. In addition, our analysis revealed that chromosome III has twice the density of insertion events compared to the other two chromosomes. Finally we describe a novel repetitive sequence, wtf, which was also preferentially located on chromosome III, and was often located near solo LTRs of Tf elements.
粟酒裂殖酵母基因组的完整DNA序列为研究转座元件(TEs)的完整互补情况、它们与特定序列的关联、在染色体上的分布及其进化提供了契机。通过基于同源性的序列鉴定,我们发现测序的粟酒裂殖酵母菌株仅包含一个全长转座子家族。这个家族,即Tf2,由13个长末端重复(LTR)逆转录转座子的全长拷贝组成。我们发现,先前存在的转座子的LTR-LTR重组产生了大量的单LTR群体。这些包括35个Tf2的单LTR,以及来自其他Tf家族的139个单LTR。对单Tf LTR的系统发育分析表明,Tf1和Tf2是基因组中最近活跃的元件。单LTR还作为Tf逆转录转座子先前插入事件的印记。对186个基因组插入事件的分析揭示了与RNA聚合酶II启动子的密切关联。这些插入集中在基因的启动子近端区域,位于蛋白质编码区域上游100至400个核苷酸处。Tf插入与pol II启动子的关联与先前观察到的Tf1整合偏好非常相似。我们发现,基因组中含有串联tRNA基因簇的着丝粒和着丝粒周围区域没有最近活跃的Tf元件。此外,我们的分析表明,与其他两条染色体相比,第三条染色体的插入事件密度是其两倍。最后,我们描述了一种新的重复序列wtf,它也优先位于第三条染色体上,并且经常位于Tf元件的单LTR附近。