Luo Jiawei, Long Yahui
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1341-1351. doi: 10.1109/TCBB.2018.2883041. Epub 2018 Nov 23.
Accumulating clinic evidences have demonstrated that the microbes residing in human bodies play a significantly important role in the formation, development, and progression of various complex human diseases. Identifying latent related microbes for disease could provide insight into human disease mechanisms and promote disease prevention, diagnosis, and treatment. In this paper, we first construct a heterogeneous network by connecting the disease similarity network and the microbe similarity network through known microbe-disease association network, and then develop a novel computational model to predict human microbe-disease associations based on random walk by integrating network topological similarity (NTSHMDA). Specifically, each microbe-disease association pair is regarded as a distinct relationship level and, thus, assigned different weights based on network topological similarity. The experimental results show that NTSHMDA outperforms some state-of-the-art methods with average AUCs of 0.9070, 0.8896 ± 0.0038 in the frameworks of Leave-one-out cross validation and 5-fold cross validation, respectively. In case studies, 9, 18, 38 and 9, 18, 45 out of top-10, 20, 50 candidate microbes are verified by recently published literatures for asthma and inflammatory bowel disease, respectively. In conclusion, NTSHMDA has potential ability to identify novel disease-microbe associations and can also provide valuable information for drug discovery and biological researches.
越来越多的临床证据表明,存在于人体中的微生物在各种复杂人类疾病的形成、发展和进程中发挥着极其重要的作用。识别与疾病潜在相关的微生物能够深入了解人类疾病机制,并促进疾病的预防、诊断和治疗。在本文中,我们首先通过已知的微生物-疾病关联网络连接疾病相似性网络和微生物相似性网络构建一个异质网络,然后开发一种基于随机游走的新型计算模型,通过整合网络拓扑相似性来预测人类微生物-疾病关联(NTSHMDA)。具体而言,每个微生物-疾病关联对被视为一个独特的关系层级,因此基于网络拓扑相似性赋予不同的权重。实验结果表明,在留一法交叉验证和五折交叉验证框架下,NTSHMDA分别以平均AUC值0.9070、0.8896±0.0038优于一些现有最先进的方法。在案例研究中,哮喘和炎症性肠病的前10、20、50名候选微生物中,分别有9、18、38个和9、18、45个被最近发表的文献证实。总之,NTSHMDA具有识别新型疾病-微生物关联的潜在能力,并且还可为药物发现和生物学研究提供有价值的信息。