School of Medicine, Tufts University, Boston, MA, United States of America.
School of Computer and Software Engineering, Huaiyin Institute of Technology, Huai'an, Jiangsu, China.
PLoS One. 2018 Apr 10;13(4):e0195410. doi: 10.1371/journal.pone.0195410. eCollection 2018.
Identifying essential proteins is very important for understanding the minimal requirements of cellular life and finding human disease genes as well as potential drug targets. Experimental methods for identifying essential proteins are often costly, time-consuming, and laborious. Many computational methods for such task have been proposed based on the topological properties of protein-protein interaction networks (PINs). However, most of these methods have limited prediction accuracy due to the noisy and incomplete natures of PINs and the fact that protein essentiality may relate to multiple biological factors. In this work, we proposed a new centrality measure, OGN, by integrating orthologous information, gene expressions, and PINs together. OGN determines a protein's essentiality by capturing its co-clustering and co-expression properties, as well as its conservation in the evolution process. The performance of OGN was tested on the species of Saccharomyces cerevisiae. Compared with several published centrality measures, OGN achieves higher prediction accuracy in both working alone and ensemble.
鉴定必需蛋白质对于理解细胞生命的最小要求以及发现人类疾病基因和潜在药物靶点非常重要。鉴定必需蛋白质的实验方法通常成本高、耗时且费力。已经提出了许多基于蛋白质-蛋白质相互作用网络(PINs)拓扑特性的此类任务的计算方法。然而,由于 PINs 的嘈杂和不完整的性质以及蛋白质必需性可能与多个生物学因素有关,这些方法中的大多数预测准确性有限。在这项工作中,我们通过将同源信息、基因表达和 PINs 整合在一起,提出了一种新的中心度度量 OGNOGN 通过捕获蛋白质的共聚类和共表达特性以及其在进化过程中的保守性来确定蛋白质的必需性。在酿酒酵母等物种上测试了 OGNOGN 的性能。与几个已发表的中心度度量相比,OGNOGN 在单独使用和集成使用时都具有更高的预测准确性。