Luo Jiawei, Liang Shiyu
School of Information Science and Engineering, Hunan University, Changsha, China.
School of Information Science and Engineering, Hunan University, Changsha, China.
J Biomed Inform. 2015 Feb;53:229-36. doi: 10.1016/j.jbi.2014.11.004. Epub 2014 Nov 15.
Identifying candidate disease genes is important to improve medical care. However, this task is challenging in the post-genomic era. Several computational approaches have been proposed to prioritize potential candidate genes relying on protein-protein interaction (PPI) networks. However, the experimental PPI network is usually liable to contain a number of spurious interactions. In this paper, we construct a reliable heterogeneous network by fusing multiple networks, a PPI network reconstructed by topological similarity, a phenotype similarity network and known associations between diseases and genes. We then devise a random walk-based algorithm on the reliable heterogeneous network called RWRHN to prioritize potential candidate genes for inherited diseases. The results of leave-one-out cross-validation experiments show that the RWRHN algorithm has better performance than the RWRH and CIPHER methods in inferring disease genes. Furthermore, RWRHN is used to predict novel causal genes for 16 diseases, including breast cancer, diabetes mellitus type 2, and prostate cancer, as well as to detect disease-related protein complexes. The top predictions are supported by literature evidence.
识别候选疾病基因对于改善医疗保健至关重要。然而,在基因组时代之后,这项任务具有挑战性。已经提出了几种计算方法,依靠蛋白质-蛋白质相互作用(PPI)网络对潜在的候选基因进行优先级排序。然而,实验性PPI网络通常容易包含许多虚假相互作用。在本文中,我们通过融合多个网络构建了一个可靠的异质网络,一个通过拓扑相似性重建的PPI网络、一个表型相似性网络以及疾病与基因之间的已知关联。然后,我们在这个可靠的异质网络上设计了一种基于随机游走的算法,称为RWRHN,用于对遗传性疾病的潜在候选基因进行优先级排序。留一法交叉验证实验结果表明,RWRHN算法在推断疾病基因方面比RWRH和CIPHER方法具有更好的性能。此外,RWRHN被用于预测16种疾病的新致病基因,包括乳腺癌、2型糖尿病和前列腺癌,以及检测与疾病相关的蛋白质复合物。顶级预测结果得到了文献证据的支持。