Schichman S A, Severynse D M, Edgell M H, Hutchison C A
Department of Microbiology and Immunology, University of North Carolina, Chapel Hill 27599.
J Mol Biol. 1992 Apr 5;224(3):559-74. doi: 10.1016/0022-2836(92)90544-t.
LINE-1 (L1) is a mammalian family of highly repeated DNA sequences that are members of a class of transposable elements whose movement involves an RNA intermediate. Both structural and evolutionary data indicate that the L1 family consists of a small number of active transposable elements interspersed with a large number of L1 pseudogenes. In the mouse, the longest, characterized L1 sequences span about 7000 base-pairs and contain two long open reading frames. Two subfamilies of mouse L1 elements, A and F, have been defined on the basis of the type of putative transcriptional regulatory sequence found at the 5' end. In order to identify a transcribed subset of L1 elements in mouse F9 teratocarcinoma cells, we have examined the strand-specificity of L1 transcription by Northern analysis and compared the open reading frame-1 sequences of ten A-type cDNAs with fifteen genomic A-type L1 elements. Transcripts containing A-type sequence are far more abundant than those containing F-type sequence. Although the majority of L1 RNA in F9 cells appears to be transcribed non-specifically from both strands, our results provide evidence for a subpopulation of variable length, strand-specific transcripts arising from A-type transcriptional regulatory sequences. F9 cell cDNA sequences, which share greater than 99.5% sequence identity with one another, represent a homogeneous subset of the genomic L1 population. Examination of genomic mouse L1 sequences reveals three types of length polymorphism in a defined segment of the first open reading frame. Phylogenetic analysis shows a correlation between the type of length polymorphism in the first open reading frame and the relative age of an individual A-type genomic L1 element. Comparison of the cDNA and genomic sequences indicates that the youngest subgroup of A-type L1 elements is preferentially transcribed in F9 cells. This subgroup may be currently dominating the L1 dispersal process in mice.
LINE-1(L1)是哺乳动物中高度重复的DNA序列家族,属于一类转座元件,其移动涉及RNA中间体。结构和进化数据均表明,L1家族由少量活跃的转座元件与大量L1假基因穿插组成。在小鼠中,最长的、已被表征的L1序列跨度约为7000个碱基对,并包含两个长开放阅读框。根据在5'端发现的推定转录调控序列类型,已定义了小鼠L1元件的两个亚家族,A和F。为了鉴定小鼠F9畸胎瘤细胞中L1元件的转录子集,我们通过Northern分析检查了L1转录的链特异性,并将十个A型cDNA的开放阅读框-1序列与十五个基因组A型L1元件进行了比较。含有A型序列的转录本比含有F型序列的转录本丰富得多。尽管F9细胞中大多数L1 RNA似乎是从两条链非特异性转录而来,但我们的结果为源自A型转录调控序列的可变长度、链特异性转录本子群体提供了证据。彼此之间序列同一性大于99.5%的F9细胞cDNA序列代表了基因组L1群体的一个同质子集。对基因组小鼠L1序列的检查揭示了第一个开放阅读框的定义片段中的三种长度多态性类型。系统发育分析表明,第一个开放阅读框中的长度多态性类型与单个A型基因组L1元件的相对年龄之间存在相关性。cDNA和基因组序列的比较表明,A型L1元件的最年轻亚组在F9细胞中优先转录。这个亚组可能目前在小鼠的L1扩散过程中占主导地位。