Sambaturu Narmada, Mishra Madhulika, Chandra Nagasuma
IISc Mathematics Initiative, Indian Institute of Science, Bangalore, 560012, India.
Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, India.
BMC Genomics. 2016 Aug 18;17 Suppl 4(Suppl 4):543. doi: 10.1186/s12864-016-2792-1.
In biological systems, diseases are caused by small perturbations in a complex network of interactions between proteins. Perturbations typically affect only a small number of proteins, which go on to disturb a larger part of the network. To counteract this, a stress-response is launched, resulting in a complex pattern of variations in the cell. Identifying the key players involved in either spreading the perturbation or responding to it can give us important insights.
We develop an algorithm, EpiTracer, which identifies the key proteins, or epicenters, from which a large number of changes in the protein-protein interaction (PPI) network ripple out. We propose a new centrality measure, ripple centrality, which measures how effectively a change at a particular node can ripple across the network by identifying highest activity paths specific to the condition of interest, obtained by mapping gene expression profiles to the PPI network. We demonstrate the algorithm using an overexpression study and a knockdown study. In the overexpression study, the gene that was overexpressed (PARK2) was highlighted as the most important epicenter specific to the perturbation. The other top-ranked epicenters were involved in either supporting the activity of PARK2, or counteracting it. Also, 5 of the identified epicenters showed no significant differential expression, showing that our method can find information which simple differential expression analysis cannot. In the second dataset (SP1 knockdown), alternative regulators of SP1 targets were highlighted as epicenters. Also, the gene that was knocked down (SP1) was picked up as an epicenter specific to the control condition. Sensitivity analysis showed that the genes identified as epicenters remain largely unaffected by small changes.
We develop an algorithm, EpiTracer, to find epicenters in condition-specific biological networks, given the PPI network and gene expression levels. EpiTracer includes programs which can extract the immediate influence zone of epicenters and provide a summary of dysregulated genes, facilitating quick biological analysis. We demonstrate its efficacy on two datasets with differing characteristics, highlighting its general applicability. We also show that EpiTracer is not sensitive to minor changes in the network. The source code for EpiTracer is provided at Github ( https://github.com/narmada26/EpiTracer ).
在生物系统中,疾病是由蛋白质间复杂相互作用网络中的微小扰动引起的。扰动通常只影响少数蛋白质,而这些蛋白质进而会扰乱网络的更大一部分。为了应对这种情况,会启动应激反应,导致细胞中出现复杂的变化模式。识别参与传播扰动或对其做出反应的关键因素可以为我们提供重要的见解。
我们开发了一种算法EpiTracer,它可以识别关键蛋白质或震中,大量蛋白质-蛋白质相互作用(PPI)网络的变化从这些关键蛋白质或震中向外扩散。我们提出了一种新的中心性度量方法——涟漪中心性,通过识别特定于感兴趣条件的最高活性路径(通过将基因表达谱映射到PPI网络获得)来衡量特定节点处的变化能够在网络中扩散的有效程度。我们使用过表达研究和敲低研究来演示该算法。在过表达研究中,过表达的基因(PARK2)被突出显示为特定于该扰动的最重要震中。其他排名靠前的震中要么参与支持PARK2的活性,要么对其进行抵消。此外,所识别的震中有5个没有显著的差异表达,这表明我们的方法可以找到简单差异表达分析无法获得的信息。在第二个数据集(SP1敲低)中,SP1靶标的替代调节因子被突出显示为震中。此外,被敲低的基因(SP1)被选为特定于对照条件的震中。敏感性分析表明,被识别为震中的基因在很大程度上不受微小变化的影响。
我们开发了一种算法EpiTracer,在给定PPI网络和基因表达水平的情况下,在特定条件的生物网络中找到震中。EpiTracer包括可以提取震中的直接影响区域并提供失调基因总结的程序,便于快速进行生物学分析。我们在两个具有不同特征的数据集上展示了它的功效,突出了其普遍适用性。我们还表明EpiTracer对网络中的微小变化不敏感。EpiTracer的源代码可在Github(https://github.com/narmada26/EpiTracer)上获取。