Theoretical Biology and Bioinformatics, Department of Biology, Faculty of Science, Utrecht University, 3584 CH Utrecht, the Netherlands.
Laboratory of Microbiology, Wageningen University & Research, 6700 EH Wageningen, the Netherlands.
Mol Biol Evol. 2023 Jan 4;40(1). doi: 10.1093/molbev/msad011.
Eukaryotic genes are characterized by the presence of introns that are removed from pre-mRNA by a spliceosome. This ribonucleoprotein complex is comprised of multiple RNA molecules and over a hundred proteins, which makes it one of the most complex molecular machines that originated during the prokaryote-to-eukaryote transition. Previous works have established that these introns and the spliceosomal core originated from self-splicing introns in prokaryotes. Yet, how the spliceosomal core expanded by recruiting many additional proteins remains largely elusive. In this study, we use phylogenetic analyses to infer the evolutionary history of 145 proteins that we could trace back to the spliceosome in the last eukaryotic common ancestor. We found that an overabundance of proteins derived from ribosome-related processes was added to the prokaryote-derived core. Extensive duplications of these proteins substantially increased the complexity of the emerging spliceosome. By comparing the intron positions between spliceosomal paralogs, we infer that most spliceosomal complexity postdates the spread of introns through the proto-eukaryotic genome. The reconstruction of early spliceosomal evolution provides insight into the driving forces behind the emergence of complexes with many proteins during eukaryogenesis.
真核生物基因的特征是存在内含子,这些内含子通过剪接体从前体 mRNA 中被切除。这种核糖核蛋白复合物由多个 RNA 分子和一百多种蛋白质组成,使其成为原核生物到真核生物过渡期间产生的最复杂的分子机器之一。先前的研究已经证实,这些内含子和剪接体核心起源于原核生物中的自我剪接内含子。然而,剪接体核心如何通过招募许多额外的蛋白质来扩展仍然很大程度上难以捉摸。在这项研究中,我们使用系统发育分析来推断 145 种蛋白质的进化历史,这些蛋白质可以追溯到最后一个真核生物共同祖先中的剪接体。我们发现,大量源自核糖体相关过程的蛋白质被添加到源自原核生物的核心中。这些蛋白质的广泛重复大大增加了新兴剪接体的复杂性。通过比较剪接体同源物之间的内含子位置,我们推断大多数剪接体的复杂性发生在内含子通过原始真核生物基因组传播之后。早期剪接体进化的重建为真核生物发生过程中许多蛋白质复合物的出现提供了驱动力的深入了解。