• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于微生物组的多模态变分信息瓶颈疾病预测。

Microbiome-based disease prediction with multimodal variational information bottlenecks.

机构信息

NEC Laboratories Europe, Heidelberg, Germany.

Department of Electrical Engineering and Computer Science, Khalifa University, Abu Dhabi, UAE.

出版信息

PLoS Comput Biol. 2022 Apr 11;18(4):e1010050. doi: 10.1371/journal.pcbi.1010050. eCollection 2022 Apr.

DOI:10.1371/journal.pcbi.1010050
PMID:35404958
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9022840/
Abstract

Scientific research is shedding light on the interaction of the gut microbiome with the human host and on its role in human health. Existing machine learning methods have shown great potential in discriminating healthy from diseased microbiome states. Most of them leverage shotgun metagenomic sequencing to extract gut microbial species-relative abundances or strain-level markers. Each of these gut microbial profiling modalities showed diagnostic potential when tested separately; however, no existing approach combines them in a single predictive framework. Here, we propose the Multimodal Variational Information Bottleneck (MVIB), a novel deep learning model capable of learning a joint representation of multiple heterogeneous data modalities. MVIB achieves competitive classification performance while being faster than existing methods. Additionally, MVIB offers interpretable results. Our model adopts an information theoretic interpretation of deep neural networks and computes a joint stochastic encoding of different input data modalities. We use MVIB to predict whether human hosts are affected by a certain disease by jointly analysing gut microbial species-relative abundances and strain-level markers. MVIB is evaluated on human gut metagenomic samples from 11 publicly available disease cohorts covering 6 different diseases. We achieve high performance (0.80 < ROC AUC < 0.95) on 5 cohorts and at least medium performance on the remaining ones. We adopt a saliency technique to interpret the output of MVIB and identify the most relevant microbial species and strain-level markers to the model's predictions. We also perform cross-study generalisation experiments, where we train and test MVIB on different cohorts of the same disease, and overall we achieve comparable results to the baseline approach, i.e. the Random Forest. Further, we evaluate our model by adding metabolomic data derived from mass spectrometry as a third input modality. Our method is scalable with respect to input data modalities and has an average training time of < 1.4 seconds. The source code and the datasets used in this work are publicly available.

摘要

科学研究揭示了肠道微生物组与人类宿主的相互作用及其在人类健康中的作用。现有的机器学习方法在区分健康和患病的微生物组状态方面显示出巨大的潜力。它们大多利用 shotgun 宏基因组测序来提取肠道微生物物种相对丰度或菌株水平标记物。当单独测试时,这些肠道微生物特征分析模式中的每一种都显示出诊断潜力;然而,没有现有的方法将它们组合在一个单一的预测框架中。在这里,我们提出了多模态变分信息瓶颈(MVIB),这是一种新的深度学习模型,能够学习多个异构数据模态的联合表示。MVIB 实现了有竞争力的分类性能,同时比现有方法更快。此外,MVIB 提供了可解释的结果。我们的模型采用了对深度神经网络的信息论解释,并计算了不同输入数据模态的联合随机编码。我们使用 MVIB 通过联合分析肠道微生物物种相对丰度和菌株水平标记物来预测人类宿主是否受到某种疾病的影响。MVIB 基于 11 个公开的疾病队列中的人类肠道宏基因组样本进行评估,涵盖了 6 种不同的疾病。我们在 5 个队列中取得了很高的性能(0.80 < ROC AUC < 0.95),在其余的队列中至少取得了中等的性能。我们采用一种显著技术来解释 MVIB 的输出,并确定与模型预测最相关的微生物物种和菌株水平标记物。我们还进行了跨研究的泛化实验,即在同一疾病的不同队列中训练和测试 MVIB,总的来说,我们的结果与基线方法(即随机森林)相当。此外,我们通过添加基于质谱的代谢组学数据作为第三个输入模态来评估我们的模型。我们的方法在输入数据模态方面具有可扩展性,平均训练时间<1.4 秒。这项工作中使用的代码和数据集都是公开的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/5973a3f0a077/pcbi.1010050.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/2da03a508de5/pcbi.1010050.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/50a0cb08d292/pcbi.1010050.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/30be086305b0/pcbi.1010050.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/1db6da14174e/pcbi.1010050.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/58b4bd21e8f4/pcbi.1010050.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/b5f7ae414f96/pcbi.1010050.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/5973a3f0a077/pcbi.1010050.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/2da03a508de5/pcbi.1010050.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/50a0cb08d292/pcbi.1010050.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/30be086305b0/pcbi.1010050.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/1db6da14174e/pcbi.1010050.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/58b4bd21e8f4/pcbi.1010050.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/b5f7ae414f96/pcbi.1010050.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d7f/9022840/5973a3f0a077/pcbi.1010050.g007.jpg

相似文献

1
Microbiome-based disease prediction with multimodal variational information bottlenecks.基于微生物组的多模态变分信息瓶颈疾病预测。
PLoS Comput Biol. 2022 Apr 11;18(4):e1010050. doi: 10.1371/journal.pcbi.1010050. eCollection 2022 Apr.
2
Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights.大型宏基因组数据集的机器学习荟萃分析:工具与生物学见解
PLoS Comput Biol. 2016 Jul 11;12(7):e1004977. doi: 10.1371/journal.pcbi.1004977. eCollection 2016 Jul.
3
Gene-based microbiome representation enhances host phenotype classification.基于基因的微生物组表示增强了宿主表型分类。
mSystems. 2023 Aug 31;8(4):e0053123. doi: 10.1128/msystems.00531-23. Epub 2023 Jul 5.
4
Automatic disease prediction from human gut metagenomic data using boosting GraphSAGE.基于提升图抽样的人类肠道宏基因组数据自动疾病预测。
BMC Bioinformatics. 2023 Mar 31;24(1):126. doi: 10.1186/s12859-023-05251-x.
5
Human reference gut microbiome catalog including newly assembled genomes from under-represented Asian metagenomes.人类参考肠道微生物组目录,包括来自代表性不足的亚洲宏基因组的新组装基因组。
Genome Med. 2021 Aug 27;13(1):134. doi: 10.1186/s13073-021-00950-7.
6
A permutable MLP-like architecture for disease prediction from gut metagenomic data.一种可置换的类似于多层感知机的架构,用于从肠道宏基因组数据中进行疾病预测。
BMC Bioinformatics. 2024 Jul 24;25(1):246. doi: 10.1186/s12859-024-05856-w.
7
Systematic evaluation of supervised machine learning for sample origin prediction using metagenomic sequencing data.基于宏基因组测序数据的样本来源预测的有监督机器学习方法的系统评价。
Biol Direct. 2020 Dec 10;15(1):29. doi: 10.1186/s13062-020-00287-y.
8
Performance of Gut Microbiome as an Independent Diagnostic Tool for 20 Diseases: Cross-Cohort Validation of Machine-Learning Classifiers.肠道微生物组作为 20 种疾病独立诊断工具的性能:基于机器学习分类器的跨队列验证。
Gut Microbes. 2023 Jan-Dec;15(1):2205386. doi: 10.1080/19490976.2023.2205386.
9
A machine learning framework to determine geolocations from metagenomic profiling.基于宏基因组分析的地理位置确定机器学习框架。
Biol Direct. 2020 Nov 23;15(1):27. doi: 10.1186/s13062-020-00278-z.
10
Massive metagenomic data analysis using abundance-based machine learning.基于丰度的机器学习在海量宏基因组数据分析中的应用。
Biol Direct. 2019 Aug 1;14(1):12. doi: 10.1186/s13062-019-0242-0.

引用本文的文献

1
Development and validation of machine learning-based diagnostic models using blood transcriptomics for early childhood diabetes prediction.基于血液转录组学的机器学习诊断模型用于幼儿糖尿病预测的开发与验证。
Front Med (Lausanne). 2025 Jul 16;12:1636214. doi: 10.3389/fmed.2025.1636214. eCollection 2025.
2
An exciting future for microbial molecular biology and physiology.微生物分子生物学与生理学的激动人心的未来。
mBio. 2025 Aug 13;16(8):e0069425. doi: 10.1128/mbio.00694-25. Epub 2025 Jun 30.
3
MSFT-transformer: a multistage fusion tabular transformer for disease prediction using metagenomic data.

本文引用的文献

1
Microbiome meta-analysis and cross-disease comparison enabled by the SIAMCAT machine learning toolbox.通过 SIAMCAT 机器学习工具箱进行微生物组荟萃分析和跨疾病比较。
Genome Biol. 2021 Mar 30;22(1):93. doi: 10.1186/s13059-021-02306-1.
2
Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions.人类微生物组研究中的统计和机器学习技术:当代挑战与解决方案
Front Microbiol. 2021 Feb 22;12:635781. doi: 10.3389/fmicb.2021.635781. eCollection 2021.
3
Machine Learning Strategy for Gut Microbiome-Based Diagnostic Screening of Cardiovascular Disease.
微软变压器:一种用于使用宏基因组数据进行疾病预测的多级融合表格变压器。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf217.
4
Deep learning in microbiome analysis: a comprehensive review of neural network models.微生物组分析中的深度学习:神经网络模型综述
Front Microbiol. 2025 Jan 22;15:1516667. doi: 10.3389/fmicb.2024.1516667. eCollection 2024.
5
Techniques for learning and transferring knowledge for microbiome-based classification and prediction: review and assessment.基于微生物组的分类和预测的知识学习与转移技术:综述与评估
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf015.
6
Effects of snake fungal disease (ophidiomycosis) on the skin microbiome across two major experimental scales.蛇真菌病(蛇类霉菌病)在两个主要实验尺度上对皮肤微生物群的影响。
Conserv Biol. 2025 Apr;39(2):e14411. doi: 10.1111/cobi.14411. Epub 2024 Nov 12.
7
MicroHDF: predicting host phenotypes with metagenomic data using a deep forest-based framework.MicroHDF:基于深度森林框架利用宏基因组数据预测宿主表型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae530.
8
Microbiota during pregnancy and early life: role in maternal-neonatal outcomes based on human evidence.孕期和生命早期的微生物群:基于人体证据的母婴结局作用。
Gut Microbes. 2024 Jan-Dec;16(1):2392009. doi: 10.1080/19490976.2024.2392009. Epub 2024 Aug 19.
9
Microbial influence on blood pressure: unraveling the complex relationship for health insights.微生物对血压的影响:揭示这种复杂关系以获取健康见解。
Microbiome Res Rep. 2024 Mar 18;3(2):22. doi: 10.20517/mrr.2023.73. eCollection 2024.
10
Intersecting Pathways in Bioinformatics and Translational Informatics: A One Health Perspective on Key Contributions and Future Directions.生物信息学和转化信息学的交汇途径:从“同一健康”视角看关键贡献和未来方向
Yearb Med Inform. 2023 Aug;32(1):99-103. doi: 10.1055/s-0043-1768745. Epub 2023 Dec 26.
基于肠道微生物组的心血管疾病诊断筛查的机器学习策略。
Hypertension. 2020 Nov;76(5):1555-1562. doi: 10.1161/HYPERTENSIONAHA.120.15885. Epub 2020 Sep 10.
4
PopPhy-CNN: A Phylogenetic Tree Embedded Architecture for Convolutional Neural Networks to Predict Host Phenotype From Metagenomic Data.PopPhy-CNN:一种将系统发生树嵌入到卷积神经网络中的架构,用于从宏基因组数据中预测宿主表型。
IEEE J Biomed Health Inform. 2020 Oct;24(10):2993-3001. doi: 10.1109/JBHI.2020.2993761. Epub 2020 May 11.
5
DeepMicro: deep representation learning for disease prediction based on microbiome data.深微:基于微生物组数据的疾病预测的深度学习表示。
Sci Rep. 2020 Apr 7;10(1):6026. doi: 10.1038/s41598-020-63159-5.
6
It's what's on the inside that counts: stress physiology and the bacterial microbiome of a wild urban mammal.内在才是关键:压力生理学与野生城市哺乳动物的细菌微生物组。
Proc Biol Sci. 2019 Oct 23;286(1913):20192111. doi: 10.1098/rspb.2019.2111.
7
Gut microbiome diversity is associated with sleep physiology in humans.肠道微生物多样性与人类的睡眠生理学有关。
PLoS One. 2019 Oct 7;14(10):e0222394. doi: 10.1371/journal.pone.0222394. eCollection 2019.
8
Multitable Methods for Microbiome Data Integration.微生物组数据整合的多表方法
Front Genet. 2019 Aug 28;10:627. doi: 10.3389/fgene.2019.00627. eCollection 2019.
9
Metagenomic and metabolomic analyses reveal distinct stage-specific phenotypes of the gut microbiota in colorectal cancer.宏基因组学和代谢组学分析揭示了结直肠癌肠道微生物群的不同阶段特异性表型。
Nat Med. 2019 Jun;25(6):968-976. doi: 10.1038/s41591-019-0458-7. Epub 2019 Jun 6.
10
The Battle Within: Interactions of Bacteriophages and Bacteria in the Gastrointestinal Tract.《体内之战:肠道噬菌体与细菌的相互作用》
Cell Host Microbe. 2019 Feb 13;25(2):210-218. doi: 10.1016/j.chom.2019.01.018.