Meng Zixuan, Kuang Linai, Chen Zhiping, Zhang Zhen, Tan Yihong, Li Xueyong, Wang Lei
College of Computer, Xiangtan University, Xiangtan, China.
College of Computer Engineering & Applied Mathematics, Changsha University, Changsha, China.
Front Genet. 2021 Mar 17;12:645932. doi: 10.3389/fgene.2021.645932. eCollection 2021.
In recent years a number of calculative models based on protein-protein interaction (PPI) networks have been proposed successively. However, due to false positives, false negatives, and the incompleteness of PPI networks, there are still many challenges affecting the design of computational models with satisfactory predictive accuracy when inferring key proteins. This study proposes a prediction model called WPDINM for detecting key proteins based on a novel weighted protein-domain interaction (PDI) network. In WPDINM, a weighted PPI network is constructed first by combining the gene expression data of proteins with topological information extracted from the original PPI network. Simultaneously, a weighted domain-domain interaction (DDI) network is constructed based on the original PDI network. Next, through integrating the newly obtained weighted PPI network and weighted DDI network with the original PDI network, a weighted PDI network is further constructed. Then, based on topological features and biological information, including the subcellular localization and orthologous information of proteins, a novel PageRank-based iterative algorithm is designed and implemented on the newly constructed weighted PDI network to estimate the criticality of proteins. Finally, to assess the prediction performance of WPDINM, we compared it with 12 kinds of competitive measures. Experimental results show that WPDINM can achieve a predictive accuracy rate of 90.19, 81.96, 70.72, 62.04, 55.83, and 51.13% in the top 1%, top 5%, top 10%, top 15%, top 20%, and top 25% separately, which exceeds the prediction accuracy achieved by traditional state-of-the-art competing measures. Owing to the satisfactory identification effect, the WPDINM measure may contribute to the further development of key protein identification.
近年来,一系列基于蛋白质-蛋白质相互作用(PPI)网络的计算模型相继被提出。然而,由于存在假阳性、假阴性以及PPI网络的不完整性,在推断关键蛋白质时,影响具有令人满意预测准确性的计算模型设计的挑战仍然众多。本研究基于一种新型加权蛋白质结构域相互作用(PDI)网络,提出了一种名为WPDINM的关键蛋白质检测预测模型。在WPDINM中,首先通过将蛋白质的基因表达数据与从原始PPI网络中提取的拓扑信息相结合,构建一个加权PPI网络。同时,基于原始PDI网络构建一个加权结构域-结构域相互作用(DDI)网络。接下来,通过将新获得的加权PPI网络和加权DDI网络与原始PDI网络进行整合,进一步构建一个加权PDI网络。然后,基于拓扑特征和生物学信息,包括蛋白质的亚细胞定位和直系同源信息,在新构建的加权PDI网络上设计并实现一种基于PageRank的新型迭代算法,以评估蛋白质的关键性。最后,为评估WPDINM的预测性能,我们将其与12种竞争方法进行了比较。实验结果表明,WPDINM在前1%、前5%、前10%、前15%、前20%和前25%中分别能达到90.19%、81.96%、70.72%、62.04%、55.83%和51.13%的预测准确率,超过了传统最先进竞争方法所达到的预测准确率。由于具有令人满意的识别效果,WPDINM方法可能有助于关键蛋白质识别的进一步发展。