Hall S L, Padgett R A
Department of Biochemistry, University of Texas Southwestern Medical Center at Dallas 75235.
J Mol Biol. 1994 Jun 10;239(3):357-65. doi: 10.1006/jmbi.1994.1377.
Eukaryotic nuclear genomes contain a rare class of pre-mRNA introns with consensus sequence features that differ markedly from most pre-mRNA introns. Four genes have so far been identified that contain one copy each of this rare intron class in addition to several standard introns. These introns and homologous introns from several species were compared to identify conserved sequence elements and to establish consensus sequences for these elements. The only well-conserved elements are found at the 5' and 3' ends of the introns. The 5' splice site sequence is ATATCCTT beginning with the first nucleotide of the intron and is invariant in the introns examined to date. The 3' splice site consensus sequence is YCCAC ending at the last nucleotide of the intron. An almost invariant sequence of TCCTTAAC is also found near the 3' end of the intron (the 3' upstream element). The length of the introns varies between 95 and 2940 nucleotides. The sequence organization of these introns suggests that they represent a variant class of pre-mRNA introns that might be spliced via a spliceosome mechanism employing factors distinct from those used by other pre-mRNA introns. A search of small nuclear RNA (snRNA) sequences for regions complementary to the conserved elements of this rare class of introns found a strong match between U12 snRNA and the 3' upstream element and a weaker match between U11 snRNA and the 5' splice site sequence.
真核细胞核基因组包含一类罕见的前体mRNA内含子,其共有序列特征与大多数前体mRNA内含子明显不同。到目前为止,已鉴定出四个基因,每个基因除了含有几个标准内含子外,还各自含有一个这种罕见内含子。对这些内含子以及来自几个物种的同源内含子进行了比较,以确定保守序列元件并建立这些元件的共有序列。唯一保守性良好的元件位于内含子的5'端和3'端。5'剪接位点序列从内含子的第一个核苷酸开始为ATATCCTT,在迄今所研究的内含子中是不变的。3'剪接位点共有序列为YCCAC,位于内含子的最后一个核苷酸处。在靠近内含子3'端(3'上游元件)还发现了几乎不变的TCCTTAAC序列。内含子的长度在95至2940个核苷酸之间变化。这些内含子的序列组织表明,它们代表了一类前体mRNA内含子的变体,可能通过剪接体机制进行剪接,该机制使用的因子与其他前体mRNA内含子所使用的因子不同。在小核RNA(snRNA)序列中搜索与这类罕见内含子的保守元件互补的区域,发现U12 snRNA与3'上游元件之间有很强的匹配,而U11 snRNA与5'剪接位点序列之间的匹配较弱。