Kaminker Joshua S, Bergman Casey M, Kronmiller Brent, Carlson Joseph, Svirskas Robert, Patel Sandeep, Frise Erwin, Wheeler David A, Lewis Suzanna E, Rubin Gerald M, Ashburner Michael, Celniker Susan E
Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK.
Genome Biol. 2002;3(12):RESEARCH0084. doi: 10.1186/gb-2002-3-12-research0084. Epub 2002 Dec 23.
Transposable elements are found in the genomes of nearly all eukaryotes. The recent completion of the Release 3 euchromatic genomic sequence of Drosophila melanogaster by the Berkeley Drosophila Genome Project has provided precise sequence for the repetitive elements in the Drosophila euchromatin. We have used this genomic sequence to describe the euchromatic transposable elements in the sequenced strain of this species.
We identified 85 known and eight novel families of transposable element varying in copy number from one to 146. A total of 1,572 full and partial transposable elements were identified, comprising 3.86% of the sequence. More than two-thirds of the transposable elements are partial. The density of transposable elements increases an average of 4.7 times in the centromere-proximal regions of each of the major chromosome arms. We found that transposable elements are preferentially found outside genes; only 436 of 1,572 transposable elements are contained within the 61.4 Mb of sequence that is annotated as being transcribed. A large proportion of transposable elements is found nested within other elements of the same or different classes. Lastly, an analysis of structural variation from different families reveals distinct patterns of deletion for elements belonging to different classes.
This analysis represents an initial characterization of the transposable elements in the Release 3 euchromatic genomic sequence of D. melanogaster for which comparison to the transposable elements of other organisms can begin to be made. These data have been made available on the Berkeley Drosophila Genome Project website for future analyses.
转座元件存在于几乎所有真核生物的基因组中。伯克利果蝇基因组计划最近完成的黑腹果蝇3号版本常染色质基因组序列,为果蝇常染色质中的重复元件提供了精确序列。我们利用这一基因组序列来描述该物种测序品系中的常染色质转座元件。
我们鉴定出85个已知的和8个新的转座元件家族,其拷贝数从1到146不等。总共鉴定出1572个完整和部分转座元件,占序列的3.86%。超过三分之二的转座元件是部分的。在每个主要染色体臂的着丝粒近端区域,转座元件的密度平均增加4.7倍。我们发现转座元件优先存在于基因之外;在注释为转录的61.4 Mb序列中,1572个转座元件中只有436个包含在内。很大一部分转座元件嵌套在相同或不同类别的其他元件中。最后,对不同家族结构变异的分析揭示了不同类别的元件有不同的缺失模式。
该分析代表了对黑腹果蝇3号版本常染色质基因组序列中转座元件的初步特征描述,据此可以开始与其他生物的转座元件进行比较。这些数据已在伯克利果蝇基因组计划网站上公布,以供未来分析使用。