Institut de Biologie de l'Ecole Normale Supérieure (IBENS), CNRS, Inserm, Ecole Normale Supérieure, PSL Research University, F-75005, Paris, France.
(Epi)genomics of Animal Development Unit, Department of Developmental and Stem Cell Biology, Institut Pasteur, 75015, Paris, France.
Genome Biol. 2017 Oct 30;18(1):208. doi: 10.1186/s13059-017-1344-6.
Most eukaryotic genes are subject to alternative splicing (AS), which may contribute to the production of protein variants or to the regulation of gene expression via nonsense-mediated messenger RNA (mRNA) decay (NMD). However, a fraction of splice variants might correspond to spurious transcripts and the question of the relative proportion of splicing errors to functional splice variants remains highly debated.
We propose a test to quantify the fraction of AS events corresponding to errors. This test is based on the fact that the fitness cost of splicing errors increases with the number of introns in a gene and with expression level. We analyzed the transcriptome of the intron-rich eukaryote Paramecium tetraurelia. We show that in both normal and in NMD-deficient cells, AS rates strongly decrease with increasing expression level and with increasing number of introns. This relationship is observed for AS events that are detectable by NMD as well as for those that are not, which invalidates the hypothesis of a link with the regulation of gene expression. Our results show that in genes with a median expression level, 92-98% of observed splice variants correspond to errors. We observed the same patterns in human transcriptomes and we further show that AS rates correlate with the fitness cost of splicing errors.
These observations indicate that genes under weaker selective pressure accumulate more maladaptive substitutions and are more prone to splicing errors. Thus, to a large extent, patterns of gene expression variants simply reflect the balance between selection, mutation, and drift.
大多数真核基因都受到选择性剪接(AS)的调控,这可能有助于产生蛋白质变体,或通过无意义介导的信使 RNA(mRNA)降解(NMD)来调节基因表达。然而,一部分剪接变体可能对应于假转录本,并且关于剪接错误与功能剪接变体的相对比例的问题仍然存在很大争议。
我们提出了一种定量检测对应于错误的 AS 事件的分数的方法。该测试基于以下事实:剪接错误的适应度成本随基因中的内含子数量和表达水平的增加而增加。我们分析了富含内含子的真核生物 Paramecium tetraurelia 的转录组。我们表明,在正常细胞和 NMD 缺陷细胞中,AS 率随着表达水平和内含子数量的增加而强烈降低。这种关系在可被 NMD 检测到的 AS 事件以及不可被 NMD 检测到的 AS 事件中都存在,这否定了与基因表达调控相关的假说。我们的结果表明,在具有中位数表达水平的基因中,观察到的剪接变体中有 92-98%对应于错误。我们在人类转录组中观察到了相同的模式,并且进一步表明 AS 率与剪接错误的适应度成本相关。
这些观察结果表明,受到较弱选择压力的基因积累了更多的适应性较差的替换,更容易发生剪接错误。因此,在很大程度上,基因表达变体的模式只是反映了选择、突变和漂移之间的平衡。