Ma Yuanyuan, Liu Guoying, Ma Yingjun, Chen Qianjun
School of Computer and Information Engineering, Anyang Normal University, Anyang, China.
School of Computer, Central China Normal University, Wuhan, China.
Front Genet. 2020 Feb 21;11:83. doi: 10.3389/fgene.2020.00083. eCollection 2020.
Microbe-disease association relationship mining is drawing more and more attention due to its potential in capturing disease-related microbes. Hence, it is essential to develop new tools or algorithms to study the complex pathogenic mechanism of microbe-related diseases. However, previous research studies mainly focused on the paradigm of "one disease, one microbe," rarely investigated the cooperation and associations between microbes, diseases or microbe-disease co-modules from system level. In this study, we propose a novel two-level module identifying algorithm (MDNMF) based on nonnegative matrix tri-factorization which integrates two similarity matrices (disease and microbe similarity matrices) and one microbe-disease association matrix into the objective of MDNMF. MDNMF can identify the modules from different levels and reveal the connections between these modules. In order to improve the efficiency and effectiveness of MDNMF, we also introduce human symptoms-disease network and microbial phylogenetic distance into this model. Furthermore, we applied it to HMDAD dataset and compared it with two NMF-based methods to demonstrate its effectiveness. The experimental results show that MDNMF can obtain better performance in terms of enrichment index (EI) and the number of significantly enriched taxon sets. This demonstrates the potential of MDNMF in capturing microbial modules that have significantly biological function implications.
微生物-疾病关联关系挖掘因其在捕获与疾病相关的微生物方面的潜力而受到越来越多的关注。因此,开发新的工具或算法来研究微生物相关疾病的复杂致病机制至关重要。然而,以往的研究主要集中在“一种疾病,一种微生物”的范式上,很少从系统层面研究微生物之间、疾病之间或微生物-疾病共模块之间的协同作用和关联。在本研究中,我们提出了一种基于非负矩阵三因子分解的新型两级模块识别算法(MDNMF),该算法将两个相似性矩阵(疾病和微生物相似性矩阵)和一个微生物-疾病关联矩阵整合到MDNMF的目标中。MDNMF可以从不同层面识别模块,并揭示这些模块之间的联系。为了提高MDNMF的效率和有效性,我们还将人类症状-疾病网络和微生物系统发育距离引入该模型。此外,我们将其应用于HMDAD数据集,并与两种基于非负矩阵分解的方法进行比较,以证明其有效性。实验结果表明,MDNMF在富集指数(EI)和显著富集分类单元集数量方面能够获得更好的性能。这证明了MDNMF在捕获具有显著生物学功能意义的微生物模块方面的潜力。