EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, 08003 Barcelona, Spain.
Universitat Pompeu Fabra (UPF), 08003 Barcelona, Spain.
Genome Res. 2017 Oct;27(10):1759-1768. doi: 10.1101/gr.220962.117. Epub 2017 Aug 30.
Alternative splicing (AS) generates remarkable regulatory and proteomic complexity in metazoans. However, the functions of most AS events are not known, and programs of regulated splicing remain to be identified. To address these challenges, we describe the Vertebrate Alternative Splicing and Transcription Database (VastDB), the largest resource of genome-wide, quantitative profiles of AS events assembled to date. VastDB provides readily accessible quantitative information on the inclusion levels and functional associations of AS events detected in RNA-seq data from diverse vertebrate cell and tissue types, as well as developmental stages. The VastDB profiles reveal extensive new intergenic and intragenic regulatory relationships among different classes of AS and previously unknown and conserved landscapes of tissue-regulated exons. Contrary to recent reports concluding that nearly all human genes express a single major isoform, VastDB provides evidence that at least 48% of multiexonic protein-coding genes express multiple splice variants that are highly regulated in a cell/tissue-specific manner, and that >18% of genes simultaneously express multiple major isoforms across diverse cell and tissue types. Isoforms encoded by the latter set of genes are generally coexpressed in the same cells and are often engaged by translating ribosomes. Moreover, they are encoded by genes that are significantly enriched in functions associated with transcriptional control, implying they may have an important and wide-ranging role in controlling cellular activities. VastDB thus provides an unprecedented resource for investigations of AS function and regulation.
可变剪接 (AS) 在后生动物中产生了显著的调控和蛋白质组复杂性。然而,大多数 AS 事件的功能尚不清楚,调控剪接的程序仍有待确定。为了解决这些挑战,我们描述了脊椎动物可变剪接和转录数据库 (VastDB),这是迄今为止组装的最大的基因组范围的 AS 事件定量图谱资源。VastDB 提供了易于访问的定量信息,包括从不同的脊椎动物细胞和组织类型以及发育阶段的 RNA-seq 数据中检测到的 AS 事件的包含水平和功能关联。VastDB 图谱揭示了不同 AS 类别之间以及以前未知和保守的组织调节外显子之间广泛的新的基因间和基因内调控关系。与最近的报告结论相反,即几乎所有人类基因都表达单一的主要异构体,VastDB 提供的证据表明,至少 48%的多外显子蛋白编码基因表达多种高度受细胞/组织特异性调控的剪接变体,而且 >18%的基因在不同的细胞和组织类型中同时表达多个主要异构体。后一组基因编码的异构体通常在相同的细胞中共同表达,并且经常被翻译核糖体结合。此外,它们由基因编码,这些基因显著富集与转录控制相关的功能,这意味着它们可能在控制细胞活动方面具有重要和广泛的作用。因此,VastDB 为 AS 功能和调控的研究提供了一个前所未有的资源。