Skolkovo Institute for Science and Technology, Moscow, Russia.
The Institute for Information Transmission Problems RAS, Moscow, Russia.
PLoS Comput Biol. 2021 Apr 7;17(4):e1008329. doi: 10.1371/journal.pcbi.1008329. eCollection 2021 Apr.
Tandem alternative splice sites (TASS) is a special class of alternative splicing events that are characterized by a close tandem arrangement of splice sites. Most TASS lack functional characterization and are believed to arise from splicing noise. Based on the RNA-seq data from the Genotype Tissue Expression project, we present an extended catalogue of TASS in healthy human tissues and analyze their tissue-specific expression. The expression of TASS is usually dominated by one major splice site (maSS), while the expression of minor splice sites (miSS) is at least an order of magnitude lower. Among 46k miSS with sufficient read support, 9k (20%) are significantly expressed above the expected noise level, and among them 2.5k are expressed tissue-specifically. We found significant correlations between tissue-specific expression of RNA-binding proteins (RBP), tissue-specific expression of miSS, and miSS response to RBP inactivation by shRNA. In combination with RBP profiling by eCLIP, this allowed prediction of novel cases of tissue-specific splicing regulation including a miSS in QKI mRNA that is likely regulated by PTBP1. The analysis of human primary cell transcriptomes suggested that both tissue-specific and cell-type-specific factors contribute to the regulation of miSS expression. More than 20% of tissue-specific miSS affect structured protein regions and may adjust protein-protein interactions or modify the stability of the protein core. The significantly expressed miSS evolve under the same selection pressure as maSS, while other miSS lack signatures of evolutionary selection and conservation. Using mixture models, we estimated that not more than 15% of maSS and not more than 54% of tissue-specific miSS are noisy, while the proportion of noisy splice sites among non-significantly expressed miSS is above 63%.
串联替代 splicing 位点(TASS)是一种特殊的可变剪接事件,其特征是剪接位点的紧密串联排列。大多数 TASS 缺乏功能表征,被认为是由剪接噪声引起的。基于基因型组织表达项目的 RNA-seq 数据,我们在健康人体组织中呈现了一个扩展的 TASS 目录,并分析了它们的组织特异性表达。TASS 的表达通常由一个主要剪接位点(maSS)主导,而次要剪接位点(miSS)的表达至少低一个数量级。在具有足够读支持的 46k miSS 中,有 9k(20%)显著表达高于预期噪声水平,其中 2.5k 具有组织特异性表达。我们发现 RNA 结合蛋白(RBP)的组织特异性表达、miSS 的组织特异性表达和 miSS 对 shRNA 失活的反应之间存在显著相关性。与 eCLIP 进行的 RBP 分析相结合,这允许预测新的组织特异性剪接调节案例,包括 QKI mRNA 中的一个可能由 PTBP1 调节的 miSS。对人类原代细胞转录组的分析表明,组织特异性和细胞类型特异性因素都有助于调节 miSS 的表达。超过 20%的组织特异性 miSS 影响结构蛋白区域,并可能调整蛋白质-蛋白质相互作用或修饰蛋白质核心的稳定性。显著表达的 miSS 在与 maSS 相同的选择压力下进化,而其他 miSS 缺乏进化选择和保守的特征。使用混合模型,我们估计 maSS 中不超过 15%,组织特异性 miSS 中不超过 54%是噪声的,而在非显著表达的 miSS 中噪声剪接位点的比例高于 63%。