Sorek Rotem, Ast Gil, Graur Dan
Department of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Ramat Aviv 69978, Israel.
Genome Res. 2002 Jul;12(7):1060-7. doi: 10.1101/gr.229302.
Alu repetitive elements are found in approximately 1.4 million copies in the human genome, comprising more than one-tenth of it. Numerous studies describe exonizations of Alu elements, that is, splicing-mediated insertions of parts of Alu sequences into mature mRNAs. To study the connection between the exonization of Alu elements and alternative splicing, we used a database of ESTs and cDNAs aligned to the human genome. We compiled two exon sets, one of 1176 alternatively spliced internal exons, and another of 4151 constitutively spliced internal exons. Sixty one alternatively spliced internal exons (5.2%) had a significant BLAST hit to an Alu sequence, but none of the constitutively spliced internal exons had such a hit. The vast majority (84%) of the Alu-containing exons that appeared within the coding region of mRNAs caused a frame-shift or a premature termination codon. Alu-containing exons were included in transcripts at lower frequencies than alternatively spliced exons that do not contain an Alu sequence. These results indicate that internal exons that contain an Alu sequence are predominantly, if not exclusively, alternatively spliced. Presumably, evolutionary events that cause a constitutive insertion of an Alu sequence into an mRNA are deleterious and selected against.
在人类基因组中大约发现了140万个Alu重复元件,占基因组的十分之一以上。许多研究描述了Alu元件的外显子化,即通过剪接将Alu序列的部分插入到成熟mRNA中。为了研究Alu元件外显子化与可变剪接之间的联系,我们使用了与人类基因组比对的EST和cDNA数据库。我们编制了两个外显子集,一个包含1176个可变剪接的内部外显子,另一个包含4151个组成型剪接的内部外显子。61个可变剪接的内部外显子(5.2%)与Alu序列有显著的BLAST匹配,但组成型剪接的内部外显子均无此匹配。出现在mRNA编码区内的绝大多数含Alu外显子(84%)会导致移码或提前终止密码子。与不含Alu序列的可变剪接外显子相比,含Alu外显子在转录本中的包含频率较低。这些结果表明,含有Alu序列的内部外显子主要(如果不是唯一的话)是可变剪接的。据推测,导致Alu序列组成型插入到mRNA中的进化事件是有害的,并会被淘汰。