Scientific Research Center, Chengdu Medical College, Chengdu, 610500, China.
Clinical Laboratory Department, Zigong Hospital of Women's and Children's Healthcare, Zigong, 643002, China.
BMC Genomics. 2024 Jun 13;25(1):595. doi: 10.1186/s12864-024-10495-9.
Nuclear introns in Euglenida have been understudied. This study aimed to investigate nuclear introns in Euglenida by identifying a large number of introns in Euglena gracilis (E. gracilis), including cis-spliced conventional and nonconventional introns, as well as trans-spliced outrons. We also examined the sequence characteristics of these introns.
A total of 28,337 introns and 11,921 outrons were identified. Conventional and nonconventional introns have distinct splice site features; the former harbour canonical GT/C-AG splice sites, whereas the latter are capable of forming structured motifs with their terminal sequences. We observed that short introns had a preference for canonical GT-AG introns. Notably, conventional introns and outrons in E. gracilis exhibited a distinct cytidine-rich polypyrimidine tract, in contrast to the thymidine-rich tracts observed in other organisms. Furthermore, the SL-RNAs in E. gracilis, as well as in other trans-splicing species, can form a recently discovered motif called the extended U6/5' ss duplex with the respective U6s. We also describe a novel type of alternative splicing pattern in E. gracilis. The tandem repeat sequences of introns in this protist were determined, and their contents were comparable to those in humans.
Our findings highlight the unique features of E. gracilis introns and provide insights into the splicing mechanism of these introns, as well as the genomics and evolution of Euglenida.
眼虫类的核内内含子研究较少。本研究旨在通过鉴定大量绿色眼虫(Euglena gracilis)中的内含子,包括顺式拼接的常规和非常规内含子以及反式拼接的外显子,来研究眼虫类的核内内含子。我们还检查了这些内含子的序列特征。
共鉴定出 28337 个内含子和 11921 个外显子。常规和非常规内含子具有明显的剪接位点特征;前者含有典型的 GT/C-AG 剪接位点,而后者能够形成具有其末端序列的结构基序。我们观察到短内含子偏爱典型的 GT-AG 内含子。值得注意的是,E. gracilis 中的常规内含子和外显子表现出独特的富含胞嘧啶的多嘧啶序列,与其他生物体中观察到的富含胸苷的序列不同。此外,E. gracilis 中的 SL-RNAs 以及其他反式剪接物种的 SL-RNAs 可以与各自的 U6 形成最近发现的称为扩展 U6/5' ss 双链体的基序。我们还描述了 E. gracilis 中一种新的可变剪接模式。该原生动物内含子的串联重复序列被确定,其含量与人类相当。
我们的发现强调了 E. gracilis 内含子的独特特征,并深入了解了这些内含子的剪接机制,以及 Euglenida 的基因组学和进化。