Shabalina Svetlana A, Ogurtsov Aleksey Y, Spiridonov Nikolay A, Koonin Eugene V
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20984, USA
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20984, USA.
Nucleic Acids Res. 2014 Jun;42(11):7132-44. doi: 10.1093/nar/gku342. Epub 2014 May 3.
Alternative splicing (AS), alternative transcription initiation (ATI) and alternative transcription termination (ATT) create the extraordinary complexity of transcriptomes and make key contributions to the structural and functional diversity of mammalian proteomes. Analysis of mammalian genomic and transcriptomic data shows that contrary to the traditional view, the joint contribution of ATI and ATT to the transcriptome and proteome diversity is quantitatively greater than the contribution of AS. Although the mean numbers of protein-coding constitutive and alternative nucleotides in gene loci are nearly identical, their distribution along the transcripts is highly non-uniform. On average, coding exons in the variable 5' and 3' transcript ends that are created by ATI and ATT contain approximately four times more alternative nucleotides than core protein-coding regions that diversify exclusively via AS. Short upstream exons that encompass alternative 5'-untranslated regions and N-termini of proteins evolve under strong nucleotide-level selection whereas in 3'-terminal exons that encode protein C-termini, protein-level selection is significantly stronger. The groups of genes that are subject to ATI and ATT show major differences in biological roles, expression and selection patterns.
可变剪接(AS)、可变转录起始(ATI)和可变转录终止(ATT)造就了转录组的非凡复杂性,并对哺乳动物蛋白质组的结构和功能多样性做出了关键贡献。对哺乳动物基因组和转录组数据的分析表明,与传统观点相反,ATI和ATT对转录组和蛋白质组多样性的联合贡献在数量上大于AS的贡献。尽管基因座中蛋白质编码组成型核苷酸和可变核苷酸的平均数量几乎相同,但它们沿转录本的分布极不均匀。平均而言,由ATI和ATT产生的可变5'和3'转录本末端的编码外显子所含可变核苷酸比仅通过AS多样化的核心蛋白质编码区域多约四倍。包含可变5'-非翻译区和蛋白质N端的短上游外显子在强烈的核苷酸水平选择下进化,而在编码蛋白质C端的3'-末端外显子中,蛋白质水平的选择明显更强。受ATI和ATT影响的基因群体在生物学作用、表达和选择模式上存在重大差异。