Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
Mol Biol Evol. 2011 Oct;28(10):2949-59. doi: 10.1093/molbev/msr127. Epub 2011 May 6.
Alternative splicing (AS) has the potential to greatly expand the functional repertoire of mammalian transcriptomes. However, few variant transcripts have been characterized functionally, making it difficult to assess the contribution of AS to the generation of phenotypic complexity and to study the evolution of splicing patterns. We have compared the AS of 309 protein-coding genes in the human ENCODE pilot regions against their mouse orthologs in unprecedented detail, utilizing traditional transcriptomic and RNAseq data. The conservation status of every transcript has been investigated, and each functionally categorized as coding (separated into coding sequence [CDS] or nonsense-mediated decay [NMD] linked) or noncoding. In total, 36.7% of human and 19.3% of mouse coding transcripts are species specific, and we observe a 3.6 times excess of human NMD transcripts compared with mouse; in contrast to previous studies, the majority of species-specific AS is unlinked to transposable elements. We observe one conserved CDS variant and one conserved NMD variant per 2.3 and 11.4 genes, respectively. Subsequently, we identify and characterize equivalent AS patterns for 22.9% of these CDS or NMD-linked events in nonmammalian vertebrate genomes, and our data indicate that functional NMD-linked AS is more widespread and ancient than previously thought. Furthermore, although we observe an association between conserved AS and elevated sequence conservation, as previously reported, we emphasize that 30% of conserved AS exons display sequence conservation below the average score for constitutive exons. In conclusion, we demonstrate the value of detailed comparative annotation in generating a comprehensive set of AS transcripts, increasing our understanding of AS evolution in vertebrates. Our data supports a model whereby the acquisition of functional AS has occurred throughout vertebrate evolution and is considered alongside amino acid change as a key mechanism in gene evolution.
选择性剪接 (AS) 有可能极大地扩展哺乳动物转录组的功能谱。然而,很少有变体转录本具有功能特征,这使得难以评估 AS 对表型复杂性产生的贡献,并研究剪接模式的进化。我们以前所未有的细节比较了人类 ENCODE 试点区域中 309 个蛋白质编码基因的 AS 与其小鼠同源物,利用了传统的转录组学和 RNAseq 数据。每个转录本的保守状态都进行了研究,并根据其功能分为编码(分为编码序列 [CDS] 或无意义介导的衰变 [NMD] 相关)或非编码。总的来说,36.7%的人类和 19.3%的小鼠编码转录本是物种特异的,我们观察到人类 NMD 转录本的数量比小鼠多 3.6 倍;与之前的研究不同,大多数物种特异性 AS 与转座元件无关。我们观察到每 2.3 个和 11.4 个基因分别有一个保守的 CDS 变体和一个保守的 NMD 变体。随后,我们在非哺乳动物脊椎动物基因组中识别和表征了这些 CDS 或 NMD 相关事件中 22.9%的等效 AS 模式,我们的数据表明功能 NMD 相关 AS 的分布范围比以前认为的更广泛,而且更古老。此外,尽管我们观察到保守 AS 与序列保守性之间存在关联,如前所述,但我们强调,30%的保守 AS 外显子的序列保守性低于组成性外显子的平均得分。总之,我们证明了详细的比较注释在生成全面的 AS 转录本集方面的价值,从而增加了我们对脊椎动物 AS 进化的理解。我们的数据支持这样一种模型,即在脊椎动物进化过程中,功能 AS 的获得被认为是与氨基酸变化一样的基因进化的关键机制。