Burke W D, Malik H S, Jones J P, Eickbush T H
Department of Biology, University of Rochester, New York 14627-0211, USA.
Mol Biol Evol. 1999 Apr;16(4):502-11. doi: 10.1093/oxfordjournals.molbev.a026132.
R2 elements are non-LTR retrotransposons that insert in the 28S rRNA genes of arthropods. Partial sequence data from many species have previously suggested that these elements have been vertically inherited since the origin of this phylum. Here, we compare the complete sequences of nine R2 elements selected to represent the diversity of arthropods. All of the elements exhibited a uniform structure. Identification of their conserved sequence features, combined with our biochemical studies, allows us to make the following inferences concerning the retrotransposition mechanism of R2. While all R2 elements insert into the identical sequence of the 28S gene, it is only the location of the initial nick in the target DNA that is rigidly conserved across arthropods. Variation at the R2 5' junctions suggests that cleavage of the second strand of the target site is not conserved within or between species. The extreme 5' and 3' ends of the elements themselves are also poorly conserved, consistent with a target primed reverse transcription mechanism for attachment of the 3' end and a template switch model for the attachment of the 5' end. Comparison of the approximately 1,000-aa R2 ORF reveals that it can be divided into three domains. The central 450-aa domain can be folded by homology modeling into a tertiary structure resembling the fingers, palm, and thumb subdomains of retroviral reverse transcriptases. The carboxyl terminal end of the R2 protein appears to be the endonuclease domain, while the amino-terminal end contains zinc finger and c-myb-like DNA-binding motifs.
R2元件是非LTR逆转座子,可插入节肢动物的28S rRNA基因中。此前,来自许多物种的部分序列数据表明,自该门动物起源以来,这些元件一直是垂直遗传的。在这里,我们比较了九个R2元件的完整序列,这些元件被选来代表节肢动物的多样性。所有元件都呈现出统一的结构。对其保守序列特征的鉴定,结合我们的生化研究,使我们能够对R2的逆转座机制做出以下推断。虽然所有R2元件都插入到28S基因的相同序列中,但只有目标DNA中初始切口的位置在节肢动物中严格保守。R2 5'连接处的变异表明,目标位点第二条链的切割在物种内部或物种之间并不保守。元件本身的极端5'和3'末端也保守性较差,这与3'末端附着的目标引发逆转录机制以及5'末端附着的模板切换模型一致。对大约1000个氨基酸的R2开放阅读框的比较表明,它可以分为三个结构域。中间450个氨基酸的结构域可以通过同源建模折叠成类似于逆转录病毒逆转录酶的手指、手掌和拇指亚结构域的三级结构。R2蛋白的羧基末端似乎是核酸内切酶结构域,而氨基末端包含锌指和c-myb样DNA结合基序。