Bostani Razieh, Mirzaie Mehdi
Department of Applied Mathematics, Faculty of Mathematical Sciences, Tarbiat Modares University, Tehran, Iran.
Iran J Biotechnol. 2020 Jul 1;18(3):e2551. doi: 10.30498/IJB.2020.2551. eCollection 2020 Jul.
Recently, many researchers from different fields of science have been used networks to analyze complex relational big data. The identification of which nodes are more important than the others, known as centrality analysis, is a key issue in biological network analysis. Although, several centralities have been introduced degree, closeness, and betweenness centralities are the most popular. These centralities are based on the individual position of each node and the cooperation and synergies between nodes have been ignored.
Since in many cases, the network function is a consequence of cooperation and interaction between nodes, classical centralities were extended to a group of nodes instead of only individual nodes using cooperative game theory concepts. In this study, we analyze the protein interaction network inferred in rabies disease and rank gene products based on group centrality measurements to identify the novel gene candidates.
For this purpose, we used a game-theoretic approach at three scenarios, where the power of a coalition of genes assessed using different criteria including the neighbors of genes in the network, and predefined importance of the genes in its neighborhood. The Shapley value of such a game was considered as a new centrality. In this study, we analyze the network of gene products implicates rabies. The network has 1059 nodes and 8844 edges and centrality analysis was performed using CINNA package in R software.
Based on three scenarios, we selected genes among the highest Shapley value that had low ranking from classical centralities. The enrichment analysis among the selected genes in scenario 1 indicates important pathways in rabies pathogenesis. Pair-wise correlation analysis reveals that changing the weights of nodes at different scenarios can significantly affect the results of ranking genes in the network.
A prior knowledge about the disease and the topology of the network, enable us to design an appropriate game and consequently infer some biological important nodes (genes) in the network. Obviously, a single centrality cannot capture all significant features embedded in the network.
最近,来自不同科学领域的许多研究人员都在使用网络来分析复杂的关系型大数据。确定哪些节点比其他节点更重要,即中心性分析,是生物网络分析中的一个关键问题。尽管已经引入了几种中心性,但度中心性、接近中心性和中介中心性是最常用的。这些中心性是基于每个节点的个体位置,而节点之间的合作与协同作用被忽略了。
由于在许多情况下,网络功能是节点之间合作与相互作用的结果,因此利用合作博弈论的概念,将经典中心性扩展到一组节点而不是单个节点。在本研究中,我们分析了狂犬病中推断出的蛋白质相互作用网络,并基于组中心性测量对基因产物进行排序,以识别新的基因候选物。
为此,我们在三种情况下使用了博弈论方法,其中基因联盟的权力使用不同标准进行评估,包括网络中基因的邻居以及其邻域中基因的预定义重要性。这种博弈的沙普利值被视为一种新的中心性。在本研究中,我们分析了与狂犬病相关的基因产物网络。该网络有1059个节点和8844条边,使用R软件中的CINNA包进行中心性分析。
基于三种情况,我们从沙普利值最高但经典中心性排名较低的基因中进行选择。情景1中所选基因的富集分析表明了狂犬病发病机制中的重要途径。成对相关性分析表明,在不同情况下改变节点权重会显著影响网络中基因排序的结果。
关于疾病和网络拓扑的先验知识,使我们能够设计一个合适的博弈,从而推断出网络中一些具有生物学重要性的节点(基因)。显然,单一的中心性无法捕捉网络中嵌入的所有显著特征。