National Engineering Research Center of Tree Breeding and Ecological Restoration, College of Biological Sciences and Technology, Beijing Forestry University, Beijing 100083, China.
Plant Physiol. 2023 Sep 22;193(2):1281-1296. doi: 10.1093/plphys/kiad375.
Introns are noncoding sequences spliced out of pre-mRNAs by the spliceosome to produce mature mRNAs. The 5' ends of introns mostly begin with GU and have a conserved sequence motif of AG/GUAAGU that could base-pair with the core sequence of U1 snRNA of the spliceosome. Intriguingly, ∼ 1% of introns in various eukaryotic species begin with GC. This occurrence could cause misannotation of genes; however, the underlying splicing mechanism is unclear. We analyzed the sequences around the intron 5' splice site (ss) in Arabidopsis (Arabidopsis thaliana) and found sequences at the GC intron ss are much more stringent than those of GT introns. Mutational analysis at various positions of the intron 5' ss revealed that although mutations impair base pairing, different mutations at the same site can have different effects, suggesting that steric hindrance also affects splicing. Moreover, mutations of 5' ss often activate a hidden ss nearby. Our data suggest that the 5' ss is selected via a competition between the major ss and the nearby minor ss. This work not only provides insights into the splicing mechanism of intron 5' ss but also improves the accuracy of gene annotation and the study of the evolution of intron 5' ss.
内含子是通过剪接体从前体 mRNA 中剪接出来的非编码序列,从而产生成熟的 mRNA。内含子的 5' 端大多以 GU 开头,并且具有保守的序列基序 AG/GUAAGU,该基序可以与剪接体的 U1 snRNA 的核心序列碱基配对。有趣的是,各种真核生物中约 1%的内含子以 GC 开头。这种情况可能会导致基因注释错误;然而,其潜在的剪接机制尚不清楚。我们分析了拟南芥(Arabidopsis thaliana)内含子 5' 剪接位点(ss)周围的序列,发现 GC 内含子 ss 处的序列比 GT 内含子更为严格。在内含子 5' ss 的不同位置进行突变分析表明,尽管突变会破坏碱基配对,但同一位置的不同突变可能会产生不同的影响,这表明空间位阻也会影响剪接。此外,5' ss 的突变通常会激活附近的隐藏 ss。我们的数据表明,5' ss 是通过主要 ss 和附近次要 ss 之间的竞争来选择的。这项工作不仅为内含子 5' ss 的剪接机制提供了深入的了解,而且提高了基因注释的准确性和内含子 5' ss 进化的研究。