Le Q H, Wright S, Yu Z, Bureau T
Department of Biology, McGill University, 1205 Docteur Penfield Avenue, Montreal, Quebec H3A 1B1, Canada.
Proc Natl Acad Sci U S A. 2000 Jun 20;97(13):7376-81. doi: 10.1073/pnas.97.13.7376.
Recent availability of extensive genome sequence information offers new opportunities to analyze genome organization, including transposon diversity and accumulation, at a level of resolution that was previously unattainable. In this report, we used sequence similarity search and analysis protocols to perform a fine-scale analysis of a large sample ( approximately 17.2 Mb) of the Arabidopsis thaliana (Columbia) genome for transposons. Consistent with previous studies, we report that the A. thaliana genome harbors diverse representatives of most known superfamilies of transposons. However, our survey reveals a higher density of transposons of which over one-fourth could be classified into a single novel transposon family designated as Basho, which appears unrelated to any previously known superfamily. We have also identified putative transposase-coding ORFs for miniature inverted-repeat transposable elements (MITEs), providing clues into the mechanism of mobility and origins of the most abundant transposons associated with plant genes. In addition, we provide evidence that most mined transposons have a clear distribution preference for A + T-rich sequences and show that structural variation for many mined transposons is partly due to interelement recombination. Taken together, these findings further underscore the complexity of transposons within the compact genome of A. thaliana.
近期广泛可得的基因组序列信息为分析基因组组织提供了新机会,包括转座子多样性和积累情况,其分辨率是以前无法达到的。在本报告中,我们使用序列相似性搜索和分析方案,对拟南芥(哥伦比亚)基因组的一个大样本(约17.2 Mb)中的转座子进行了精细分析。与之前的研究一致,我们报告称拟南芥基因组包含了大多数已知转座子超家族的不同代表。然而,我们的调查揭示了转座子的更高密度,其中超过四分之一可归类为一个名为Basho的单一新型转座子家族,它似乎与任何先前已知的超家族无关。我们还鉴定了微型反向重复转座元件(MITEs)的假定转座酶编码开放阅读框(ORFs),为与植物基因相关的最丰富转座子的移动机制和起源提供了线索。此外,我们提供证据表明,大多数挖掘出的转座子对富含A + T的序列有明显的分布偏好,并表明许多挖掘出的转座子的结构变异部分归因于元件间重组。综上所述,这些发现进一步强调了拟南芥紧凑基因组中转座子的复杂性。