Noma K, Ohtsubo H, Ohtsubo E
Institute of Molecular and Cellular Biosciences, The University of Tokyo, Japan.
DNA Res. 2000 Oct 31;7(5):291-303. doi: 10.1093/dnares/7.5.291.
Non-LTR retrotransposons (LINEs) are ubiquitous elements in the plant kingdom. Two hundred and nineteen LINE homologues (named ATLN) were identified in the A. thaliana genome, about 90% of which have been sequenced by a computer-aided homology search. Of these, the structures of 62 were analyzed in detail. Most, including those truncated for the 5' regions, were flanked by direct repeats of a sequence of 7-21 bp long, the target site sequence duplicated upon retrotransposition of each member. Thirty ATLN members had two consecutive open reading frames, corresponding to orf1 and orf2 essential for retrotransposition. The phylogenetic tree constructed from the amino acid sequences of the endonuclease domains of the Orf2 proteins showed that the ATLN members were grouped in two families (I and II) and that the members of each family could be further divided into several subfamilies. The members of each subfamily had several unique structural features in common in the intergenic region between orf1 and orf2 as well as in the downstream regions of orf2. Interestingly, orf2 in almost all the ATLN members is located in the -1 frame relative to orf1, indicative of the existence of such translational control mechanisms as translational coupling or frameshifting to produce an amount of Orf2 protein appropriate to that of Orf1. Moreover, the most proximal sequences in the 5' untranslated regions were non-homologous, even in members with the highest homology, unlike the LINEs in animals. The non-homologous sequences in the 5' untranslated regions might be acquired at or after transcription during retrotransposition of the ATLN elements.
非长末端重复逆转座子(LINEs)是植物界中普遍存在的元件。在拟南芥基因组中鉴定出了219个LINE同源物(命名为ATLN),其中约90%已通过计算机辅助同源性搜索进行了测序。在这些同源物中,对62个的结构进行了详细分析。大多数,包括那些5'区域被截断的,两侧都有7 - 21 bp长的序列的直接重复,即每个成员逆转座时目标位点序列被复制。30个ATLN成员有两个连续的开放阅读框,对应于逆转座所必需的orf1和orf2。根据Orf2蛋白的内切酶结构域的氨基酸序列构建的系统发育树表明,ATLN成员被分为两个家族(I和II),并且每个家族的成员可以进一步分为几个亚家族。每个亚家族的成员在orf1和orf2之间的基因间隔区以及orf2的下游区域有几个共同的独特结构特征。有趣的是,几乎所有ATLN成员中的orf2相对于orf1位于 -1 读框中,这表明存在诸如翻译偶联或移码等翻译控制机制,以产生与Orf1量相适应的一定量的Orf2蛋白。此外,与动物中的LINEs不同,即使是同源性最高的成员,其5'非翻译区的最近端序列也不同源。5'非翻译区的非同源序列可能是在ATLN元件逆转座过程中在转录时或转录后获得的。