Suratanee Apichat, Plaimas Kitiporn
Department of Mathematics, Faculty of Applied Science, King Mongkut's University of Technology North Bangkok, Bangkok, Thailand.
Advanced Virtual and Intelligent Computing (AVIC) Center, Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok, Thailand.
Bioinform Biol Insights. 2017 Jul 13;11:1177932217720405. doi: 10.1177/1177932217720405. eCollection 2017.
The associations between proteins and diseases are crucial information for investigating pathological mechanisms. However, the number of known and reliable protein-disease associations is quite small. In this study, an analysis framework to infer associations between proteins and diseases was developed based on a large data set of a human protein-protein interaction network integrating an effective network search, namely, the reverse -nearest neighbor (RNN) search. The RNN search was used to identify an impact of a protein on other proteins. Then, associations between proteins and diseases were inferred statistically. The method using the RNN search yielded a much higher precision than a random selection, standard nearest neighbor search, or when applying the method to a random protein-protein interaction network. All protein-disease pair candidates were verified by a literature search. Supporting evidence for 596 pairs was identified. In addition, cluster analysis of these candidates revealed 10 promising groups of diseases to be further investigated experimentally. This method can be used to identify novel associations to better understand complex relationships between proteins and diseases.
蛋白质与疾病之间的关联是研究病理机制的关键信息。然而,已知且可靠的蛋白质-疾病关联数量相当少。在本研究中,基于整合了有效网络搜索(即反向最近邻(RNN)搜索)的人类蛋白质-蛋白质相互作用网络的大数据集,开发了一种推断蛋白质与疾病之间关联的分析框架。RNN搜索用于识别一种蛋白质对其他蛋白质的影响。然后,通过统计学方法推断蛋白质与疾病之间的关联。使用RNN搜索的方法比随机选择、标准最近邻搜索或在随机蛋白质-蛋白质相互作用网络上应用该方法具有更高的精度。所有蛋白质-疾病对候选物均通过文献检索进行了验证。确定了596对的支持证据。此外,对这些候选物的聚类分析揭示了10个有前景的疾病组有待进一步进行实验研究。该方法可用于识别新的关联,以更好地理解蛋白质与疾病之间的复杂关系。