Database Research Group, Control and Intelligent Processing Center of Excellence, School of Electrical and Computer Engineering, University College of Engineering, University of Tehran, Tehran, Iran.
Comput Biol Chem. 2013 Apr;43:11-6. doi: 10.1016/j.compbiolchem.2012.12.003. Epub 2012 Dec 20.
There is a large gap between the number of discovered proteins and the number of functionally annotated ones. Due to the high cost of determining protein function by wet-lab research, function prediction has become a major task for computational biology and bioinformatics. Some researches utilize the proteins interaction information to predict function for un-annotated proteins. In this paper, we propose a novel approach called "Neighbor Relativity Coefficient" (NRC) based on interaction network topology which estimates the functional similarity between two proteins. NRC is calculated for each pair of proteins based on their graph-based features including distance, common neighbors and the number of paths between them. In order to ascribe function to an un-annotated protein, NRC estimates a weight for each neighbor to transfer its annotation to the unknown protein. Finally, the unknown protein will be annotated by the top score transferred functions. We also investigate the effect of using different coefficients for various types of functions. The proposed method has been evaluated on Saccharomyces cerevisiae and Homo sapiens interaction networks. The performance analysis demonstrates that NRC yields better results in comparison with previous protein function prediction approaches that utilize interaction network.
已发现的蛋白质数量与功能注释数量之间存在很大差距。由于通过湿实验室研究确定蛋白质功能的成本很高,因此功能预测已成为计算生物学和生物信息学的主要任务。一些研究利用蛋白质相互作用信息来预测未注释蛋白质的功能。在本文中,我们提出了一种基于互作网络拓扑的新方法,称为“邻接相对系数”(NRC),用于估计两个蛋白质之间的功能相似性。根据距离、共同邻居和它们之间路径的数量等基于图的特征,为每对蛋白质计算 NRC。为了给未注释的蛋白质赋予功能,NRC 会为每个邻居估计一个权重,以将其注释转移到未知蛋白质上。最后,未知蛋白质将通过转移的最高分数来注释。我们还研究了为不同类型的功能使用不同系数的效果。该方法已在酿酒酵母和人类互作网络上进行了评估。性能分析表明,与利用互作网络的先前蛋白质功能预测方法相比,NRC 产生了更好的结果。