Nozawa Masafumi, Kumagai Masahiko, Aotsuka Tadashi, Tamura Koichiro
Department of Biological Sciences, Graduate School of Science, Tokyo Metropolitan University, Tokyo, Japan.
Mol Biol Evol. 2006 May;23(5):981-7. doi: 10.1093/molbev/msj105. Epub 2006 Feb 8.
New repeat sequences were found in the Drosophila ananassae genome sequence. They accounted for approximately 1.2% of the D. ananassae genome and were estimated to be more abundant in genomes of its closely related species belonging to the Drosophila bipectinata complex, whereas it was entirely absent in the Drosophila melanogaster genome. They were interspersed throughout euchromatic regions of the genome, usually as short tandem arrays of unit sequences, which were mostly 175-200 bp long with two distinct peaks at 180 and 189 bp in the length distribution. The nucleotide differences among unit sequences within the same array (locus) were much smaller than those between separate loci, suggesting within-locus concerted evolution. The phylogenetic tree of the repeat sequences from different loci showed that divergences between sequences from different chromosome arms occurred only at earlier stages of evolution, while those within the same chromosome arm occurred thereafter, resulting in the increase in copy number. We found RNA polymerase III promoter sequences (A box and B box), which play a critical role in retroposition of short interspersed elements. We also found conserved stem-loop structures, which are possibly associated with certain DNA rearrangements responsible for the increase in copy number within a chromosome arm. Such an atypical combination of characteristics (i.e., wide dispersal and tandem repetition) may have been generated by these different transposition mechanisms during the course of evolution.
在果蝇(Drosophila ananassae)基因组序列中发现了新的重复序列。它们约占果蝇基因组的1.2%,据估计在属于果蝇双栉角果蝇复合体(Drosophila bipectinata complex)的其近缘物种基因组中更为丰富,而在黑腹果蝇(Drosophila melanogaster)基因组中则完全不存在。它们散布在基因组的常染色质区域,通常呈短串联排列的单位序列,大多长度为175 - 200 bp,长度分布在180和189 bp处有两个明显的峰值。同一阵列(位点)内单位序列之间的核苷酸差异远小于不同位点之间的差异,表明在位点内存在协同进化。来自不同位点的重复序列的系统发育树表明,不同染色体臂的序列之间的分歧仅发生在进化的早期阶段,而同一染色体臂内的分歧则在此后发生,导致拷贝数增加。我们发现了RNA聚合酶III启动子序列(A框和B框),它们在短散在元件的逆转座中起关键作用。我们还发现了保守的茎环结构,它们可能与某些负责染色体臂内拷贝数增加的DNA重排有关。这种非典型的特征组合(即广泛散布和串联重复)可能是在进化过程中由这些不同的转座机制产生的。