Alam Shafiul, Phan Huong Thi Thanh, Okazaki Mio, Takagi Masahiro, Kawahara Kozo, Tsukahara Toshifumi, Suzuki Hitoshi
School of Materials Science, Japan Advanced Institute of Science and Technology, Nomi, Ishikawa 923-1292, Japan.
BMC Res Notes. 2014 Dec 19;7:934. doi: 10.1186/1756-0500-7-934.
Generally, the results of high throughput analyses contain information about gene expressions, and about exon expressions. Approximately 90% of primary protein-coding transcripts undergo alternative splicing in mammals. However, changes induced by alternative exons have not been properly analyzed for their impact on important molecular networks or their biological events. Even when alternative exons are identified, they are usually subjected to bioinformatics analysis in the same way as the gene ignoring the possibility of functionality change because of the alteration of domain caused by alternative exon. Here, we reveal an effective computational approach to explore an important molecular network based on potential changes of functionality induced by alternative exons obtained from our comprehensive analysis of neuronal cell differentiation.
From our previously identified 262 differentially alternatively spliced exons during neuronal cell differentiations, we extracted 241 sets that changed the amino acid sequences between the alternatively spliced sequences. Conserved domain searches indicated that annotated domain(s) were changed in 128 sets. We obtained 49 genes whose terms overlapped between domain description and gene annotation. Thus, these 49 genes have alternatively differentially spliced in exons that affect their main functions. We performed pathway analysis using these 49 genes and identified the EGFR (epidermal growth factor receptor) and mTOR (mammalian target of rapamycin) signaling pathway as being involved frequently. Recent studies reported that the mTOR pathway is associated with neuronal cell differentiation, vindicating that our approach extracted an important molecular network successfully.
Effective informatics approaches for exons should be more complex than those for genes, because changes in alternative exons affect protein functions via alterations of amino acid sequences and functional domains. Our method extracted alterations of functional domains and identified key alternative splicing events. We identified the EGFR and mTOR signaling pathway as the most affected pathway. The mTOR pathway is important for neuronal differentiation, suggesting that this in silico extraction of alternative splicing networks is useful. This preliminary analysis indicated that automated analysis of the effects of alternative splicing would provide a rich source of biologically relevant information.
一般来说,高通量分析的结果包含基因表达和外显子表达的信息。在哺乳动物中,约90%的初级蛋白质编码转录本会发生可变剪接。然而,可变外显子所诱导的变化对重要分子网络或生物事件的影响尚未得到恰当分析。即使可变外显子被识别出来,它们通常也会像基因一样以相同方式进行生物信息学分析,而忽略了由于可变外显子导致结构域改变而引起功能变化的可能性。在此,我们揭示了一种有效的计算方法,基于从我们对神经元细胞分化的全面分析中获得的可变外显子所诱导的潜在功能变化,来探索一个重要的分子网络。
从我们之前在神经元细胞分化过程中鉴定出的262个差异可变剪接外显子中,我们提取了241组在可变剪接序列之间改变氨基酸序列的外显子。保守结构域搜索表明,128组外显子中注释的结构域发生了变化。我们获得了49个基因,其结构域描述和基因注释之间的术语存在重叠。因此,这49个基因在影响其主要功能的外显子中发生了差异可变剪接。我们使用这49个基因进行了通路分析,并确定表皮生长因子受体(EGFR)和雷帕霉素哺乳动物靶点(mTOR)信号通路频繁涉及其中。最近的研究报道mTOR通路与神经元细胞分化相关,这证明我们的方法成功提取了一个重要的分子网络。
针对外显子的有效信息学方法应该比针对基因的方法更复杂,因为可变外显子的变化通过氨基酸序列和功能结构域的改变来影响蛋白质功能。我们的方法提取了功能结构域的变化并识别出关键的可变剪接事件。我们确定EGFR和mTOR信号通路是受影响最大的通路。mTOR通路对神经元分化很重要,这表明这种对可变剪接网络的计算机提取是有用的。这项初步分析表明,对可变剪接效应的自动化分析将提供丰富的生物学相关信息来源。