School of Computer Engineering, Hubei University of Arts and Science, Xiangyang, Hubei, China.
Hubei Key Laboratory of Power System Design and Test for Electrical Vehicle, Hubei University of Arts and Science, Xiangyang, China.
NPJ Biofilms Microbiomes. 2024 Nov 24;10(1):135. doi: 10.1038/s41522-024-00612-7.
The rapid development of high-throughput sequencing techniques provides an unprecedented opportunity to generate biological insights into microbiome-related diseases. However, the relationships among microbes, metabolites and human microenvironment are extremely complex, making data analysis challenging. Here, we present NMFGOT, which is a versatile toolkit for the integrative analysis of microbiome and metabolome data from the same samples. NMFGOT is an unsupervised learning framework based on nonnegative matrix factorization with graph regularized optimal transport, where it utilizes the optimal transport plan to measure the probability distance between microbiome samples, which better dealt with the nonlinear high-order interactions among microbial taxa and metabolites. Moreover, it also includes a spatial regularization term to preserve the spatial consistency of samples in the embedding space across different data modalities. We implemented NMFGOT in several multi-omics microbiome datasets from multiple cohorts. The experimental results showed that NMFGOT consistently performed well compared with several recently published multi-omics integrating methods. Moreover, NMFGOT also facilitates downstream biological analysis, including pathway enrichment analysis and disease-specific metabolite-microbe association analysis. Using NMFGOT, we identified the significantly and stable metabolite-microbe associations in GC and ESRD diseases, which improves our understanding for the mechanisms of human complex diseases.
高通量测序技术的快速发展为研究微生物组相关疾病提供了前所未有的生物学见解。然而,微生物、代谢物和人类微环境之间的关系极其复杂,使得数据分析具有挑战性。在这里,我们提出了 NMFGOT,这是一个用于整合分析来自相同样本的微生物组和代谢组数据的多功能工具包。NMFGOT 是一种基于非负矩阵分解和图正则最优传输的无监督学习框架,它利用最优传输计划来测量微生物组样本之间的概率距离,从而更好地处理微生物类群和代谢物之间的非线性高阶相互作用。此外,它还包括一个空间正则化项,以保持不同数据模态之间样本在嵌入空间中的空间一致性。我们在来自多个队列的多个多组学微生物组数据集上实现了 NMFGOT。实验结果表明,NMFGOT 与最近发表的几种多组学整合方法相比,性能始终表现良好。此外,NMFGOT 还促进了下游的生物学分析,包括通路富集分析和特定疾病的代谢物-微生物关联分析。使用 NMFGOT,我们确定了 GC 和 ESRD 疾病中显著且稳定的代谢物-微生物关联,这提高了我们对人类复杂疾病机制的理解。
NPJ Biofilms Microbiomes. 2024-11-24
BMC Microbiol. 2021-8-28
Prog Mol Biol Transl Sci. 2020
Front Cell Infect Microbiol. 2021
J Mol Biol. 2022-8-15
Microbiol Spectr. 2024-11-5
Nat Protoc. 2024-5
Kidney360. 2023-8-1
NPJ Biofilms Microbiomes. 2022-10-15
Trends Microbiol. 2022-7
Cell Death Dis. 2021-11-24
Antibiotics (Basel). 2021-3-11