Aghamirzaie Delasa, Collakova Eva, Li Song, Grene Ruth
Genetics, Bioinformatics and Computational Biology, Virginia Tech, Blacksburg, VA, 24061, USA.
Department of Plant Pathology, Physiology, and Weed Science, Virginia Tech, Blacksburg, VA, 24061, USA.
BMC Genomics. 2016 Oct 28;17(1):845. doi: 10.1186/s12864-016-3172-6.
Alternative splicing has been proposed to increase transcript diversity and protein plasticity in eukaryotic organisms, but the extent to which this is the case is currently unclear, especially with regard to the diversification of molecular function. Eukaryotic splicing involves complex interactions of splicing factors and their targets. Inference of co-splicing networks capturing these types of interactions is important for understanding this crucial, highly regulated post-transcriptional process at the systems level.
First, several transcript and protein attributes, including coding potential of transcripts and differences in functional domains of proteins, were compared between splice variants and protein isoforms to assess transcript and protein diversity in a biological system. Alternative splicing was shown to increase transcript and function-related protein diversity in developing Arabidopsis embryos. Second, CoSpliceNet, which integrates co-expression and motif discovery at splicing regulatory regions to infer co-splicing networks, was developed. CoSpliceNet was applied to temporal RNA sequencing data to identify candidate regulators of splicing events and predict RNA-binding motifs, some of which are supported by prior experimental evidence. Analysis of inferred splicing factor targets revealed an unexpected role for the unfolded protein response in embryo development.
The methods presented here can be used in any biological system to assess transcript diversity and protein plasticity and to predict candidate regulators, their targets, and RNA-binding motifs for splicing factors. CoSpliceNet is freely available at http://delasa.github.io/co-spliceNet/ .
可变剪接被认为可增加真核生物中转录本的多样性和蛋白质的可塑性,但目前尚不清楚其实际程度,尤其是在分子功能多样化方面。真核生物的剪接涉及剪接因子及其靶标的复杂相互作用。推断捕获此类相互作用的共剪接网络对于从系统层面理解这一关键的、高度调控的转录后过程至关重要。
首先,比较了剪接变体和蛋白质异构体之间的几种转录本和蛋白质属性,包括转录本的编码潜力和蛋白质功能域的差异,以评估生物系统中的转录本和蛋白质多样性。结果表明,可变剪接增加了发育中的拟南芥胚胎中的转录本和功能相关蛋白质的多样性。其次,开发了CoSpliceNet,它整合了剪接调控区域的共表达和基序发现,以推断共剪接网络。将CoSpliceNet应用于时间RNA测序数据,以识别剪接事件的候选调节因子并预测RNA结合基序,其中一些得到了先前实验证据的支持。对推断的剪接因子靶标的分析揭示了未折叠蛋白反应在胚胎发育中的意外作用。
本文介绍的方法可用于任何生物系统,以评估转录本多样性和蛋白质可塑性,并预测剪接因子的候选调节因子、其靶标和RNA结合基序。CoSpliceNet可在http://delasa.github.io/co-spliceNet/免费获取。