• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于基因的微生物组表示增强了宿主表型分类。

Gene-based microbiome representation enhances host phenotype classification.

机构信息

Centre Nutrition, Santé et Société (NUTRISS) - Institut sur la Nutrition et les Aliments Fonctionnels (INAF), Université Laval , Québec, Canada.

Canada Research Excellence Chair on the Microbiome-Endocannabinoidome Axis in Metabolic Health (CERC-MEND) , Quebec City, Quebec, Canada.

出版信息

mSystems. 2023 Aug 31;8(4):e0053123. doi: 10.1128/msystems.00531-23. Epub 2023 Jul 5.

DOI:10.1128/msystems.00531-23
PMID:37404032
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10469787/
Abstract

With the concomitant advances in both the microbiome and machine learning fields, the gut microbiome has become of great interest for the potential discovery of biomarkers to be used in the classification of the host health status. Shotgun metagenomics data derived from the human microbiome is composed of a high-dimensional set of microbial features. The use of such complex data for the modeling of host-microbiome interactions remains a challenge as retaining content yields a highly granular set of microbial features. In this study, we compared the prediction performances of machine learning approaches according to different types of data representations derived from shotgun metagenomics. These representations include commonly used taxonomic and functional profiles and the more granular gene cluster approach. For the five case-control datasets used in this study (Type 2 diabetes, obesity, liver cirrhosis, colorectal cancer, and inflammatory bowel disease), gene-based approaches, whether used alone or in combination with reference-based data types, allowed improved or similar classification performances as the taxonomic and functional profiles. In addition, we show that using subsets of gene families from specific functional categories of genes highlight the importance of these functions on the host phenotype. This study demonstrates that both reference-free microbiome representations and curated metagenomic annotations can provide relevant representations for machine learning based on metagenomic data. IMPORTANCE Data representation is an essential part of machine learning performance when using metagenomic data. In this work, we show that different microbiome representations provide varied host phenotype classification performance depending on the dataset. In classification tasks, untargeted microbiome gene content can provide similar or improved classification compared to taxonomical profiling. Feature selection based on biological function also improves classification performance for some pathologies. Function-based feature selection combined with interpretable machine learning algorithms can generate new hypotheses that can potentially be assayed mechanistically. This work thus proposes new approaches to represent microbiome data for machine learning that can potentiate the findings associated with metagenomic data.

摘要

随着微生物组学和机器学习领域的共同进步,肠道微生物组成为发现用于宿主健康状况分类的生物标志物的热点。源自人类微生物组的宏基因组学数据由一组高维微生物特征组成。由于保留内容会产生高度细化的微生物特征集,因此使用此类复杂数据来模拟宿主-微生物组相互作用仍然是一个挑战。在这项研究中,我们根据源自宏基因组学的不同类型数据表示形式比较了机器学习方法的预测性能。这些表示形式包括常用的分类和功能谱以及更细粒度的基因簇方法。对于本研究中使用的五个病例对照数据集(2 型糖尿病、肥胖症、肝硬化、结直肠癌和炎症性肠病),基于基因的方法,无论是单独使用还是与基于参考的数据类型结合使用,都允许改进或类似的分类性能,与分类和功能谱相同。此外,我们表明,使用特定功能类别基因的基因家族子集突出了这些功能对宿主表型的重要性。这项研究表明,参考免费的微生物组表示形式和经过策展的宏基因组注释都可以为基于宏基因组数据的机器学习提供相关表示形式。

重要性 当使用宏基因组数据时,数据表示是机器学习性能的重要组成部分。在这项工作中,我们表明,不同的微生物组表示形式根据数据集提供不同的宿主表型分类性能。在分类任务中,与分类学分析相比,无目标的微生物组基因含量可以提供相似或改善的分类。基于生物学功能的特征选择也可以提高某些病理的分类性能。功能基特征选择与可解释的机器学习算法相结合,可以生成新的假说,这些假说可能具有潜在的机制检验能力。因此,这项工作提出了用于机器学习的微生物组数据表示的新方法,可以增强与宏基因组数据相关的发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/76a9daa1faf0/msystems.00531-23.f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/d5aa0ea4ab04/msystems.00531-23.f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/f71483fb1297/msystems.00531-23.f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/3c586d435414/msystems.00531-23.f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/bab008f2f1f5/msystems.00531-23.f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/76a9daa1faf0/msystems.00531-23.f005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/d5aa0ea4ab04/msystems.00531-23.f001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/f71483fb1297/msystems.00531-23.f002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/3c586d435414/msystems.00531-23.f003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/bab008f2f1f5/msystems.00531-23.f004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2a9/10469787/76a9daa1faf0/msystems.00531-23.f005.jpg

相似文献

1
Gene-based microbiome representation enhances host phenotype classification.基于基因的微生物组表示增强了宿主表型分类。
mSystems. 2023 Aug 31;8(4):e0053123. doi: 10.1128/msystems.00531-23. Epub 2023 Jul 5.
2
Machine Learning Meta-analysis of Large Metagenomic Datasets: Tools and Biological Insights.大型宏基因组数据集的机器学习荟萃分析:工具与生物学见解
PLoS Comput Biol. 2016 Jul 11;12(7):e1004977. doi: 10.1371/journal.pcbi.1004977. eCollection 2016 Jul.
3
Comprehensive Functional Annotation of Metagenomes and Microbial Genomes Using a Deep Learning-Based Method.基于深度学习的宏基因组和微生物组综合功能注释。
mSystems. 2023 Apr 27;8(2):e0117822. doi: 10.1128/msystems.01178-22. Epub 2023 Mar 7.
4
Inflammatory bowel disease biomarkers of human gut microbiota selected via different feature selection methods.基于不同特征选择方法筛选出的人类肠道微生物组炎症性肠病生物标志物。
PeerJ. 2022 Apr 25;10:e13205. doi: 10.7717/peerj.13205. eCollection 2022.
5
Systematic evaluation of supervised machine learning for sample origin prediction using metagenomic sequencing data.基于宏基因组测序数据的样本来源预测的有监督机器学习方法的系统评价。
Biol Direct. 2020 Dec 10;15(1):29. doi: 10.1186/s13062-020-00287-y.
6
Functional dynamics of bacterial species in the mouse gut microbiome revealed by metagenomic and metatranscriptomic analyses.基于宏基因组和宏转录组分析揭示的小鼠肠道微生物组中细菌物种的功能动态。
PLoS One. 2020 Jan 24;15(1):e0227886. doi: 10.1371/journal.pone.0227886. eCollection 2020.
7
Automatic disease prediction from human gut metagenomic data using boosting GraphSAGE.基于提升图抽样的人类肠道宏基因组数据自动疾病预测。
BMC Bioinformatics. 2023 Mar 31;24(1):126. doi: 10.1186/s12859-023-05251-x.
8
Microbiome-based disease prediction with multimodal variational information bottlenecks.基于微生物组的多模态变分信息瓶颈疾病预测。
PLoS Comput Biol. 2022 Apr 11;18(4):e1010050. doi: 10.1371/journal.pcbi.1010050. eCollection 2022 Apr.
9
CViewer: a Java-based statistical framework for integration of shotgun metagenomics with other omics datasets.CViewer:一个基于 Java 的统计框架,用于将鸟枪法宏基因组学与其他组学数据集进行整合。
Microbiome. 2024 Jun 29;12(1):117. doi: 10.1186/s40168-024-01834-9.
10
Metagenomic discovery of microbial eukaryotes in stool microbiomes.粪便微生物组中微生物真核生物的宏基因组学发现。
mBio. 2024 Oct 16;15(10):e0206324. doi: 10.1128/mbio.02063-24. Epub 2024 Aug 29.

引用本文的文献

1
Predictive modeling of colorectal cancer using exhaustive analysis of microbiome information layers available from public metagenomic data.利用公开宏基因组数据中可用的微生物组信息层进行详尽分析对结直肠癌进行预测建模。
Front Microbiol. 2024 Aug 26;15:1426407. doi: 10.3389/fmicb.2024.1426407. eCollection 2024.
2
microBiomeGSM: the identification of taxonomic biomarkers from metagenomic data using grouping, scoring and modeling (G-S-M) approach.微生物群落GSM:使用分组、评分和建模(G-S-M)方法从宏基因组数据中识别分类学生物标志物。
Front Microbiol. 2023 Nov 22;14:1264941. doi: 10.3389/fmicb.2023.1264941. eCollection 2023.

本文引用的文献

1
The diet rapidly and differentially affects the gut microbiota and host lipid mediators in a healthy population.这种饮食方式在健康人群中能快速且差异化地影响肠道微生物群和宿主脂质介质。
Microbiome. 2023 Feb 11;11(1):26. doi: 10.1186/s40168-023-01469-2.
2
Influence of diet on acute endocannabinoidome mediator levels post exercise in active women, a crossover randomized study.饮食对运动后活跃女性急性内源性大麻素介质水平的影响:一项交叉随机研究。
Sci Rep. 2022 May 20;12(1):8568. doi: 10.1038/s41598-022-10757-0.
3
Host phenotype classification from human microbiome data is mainly driven by the presence of microbial taxa.
从人类微生物组数据中进行宿主表型分类主要受微生物分类群的存在驱动。
PLoS Comput Biol. 2022 Apr 21;18(4):e1010066. doi: 10.1371/journal.pcbi.1010066. eCollection 2022 Apr.
4
Microbiome-based disease prediction with multimodal variational information bottlenecks.基于微生物组的多模态变分信息瓶颈疾病预测。
PLoS Comput Biol. 2022 Apr 11;18(4):e1010050. doi: 10.1371/journal.pcbi.1010050. eCollection 2022 Apr.
5
Human disease prediction from microbiome data by multiple feature fusion and deep learning.通过多特征融合和深度学习从微生物组数据预测人类疾病
iScience. 2022 Mar 16;25(4):104081. doi: 10.1016/j.isci.2022.104081. eCollection 2022 Apr 15.
6
Gut microbiome-derived glycine lipids are diet-dependent modulators of hepatic injury and atherosclerosis.肠道微生物组衍生的甘油脂是饮食依赖性肝损伤和动脉粥样硬化的调节剂。
J Lipid Res. 2022 Apr;63(4):100192. doi: 10.1016/j.jlr.2022.100192. Epub 2022 Mar 10.
7
A specific microbiota signature is associated to various degrees of ulcerative colitis as assessed by a machine learning approach.通过机器学习方法评估,特定的微生物群特征与溃疡性结肠炎的不同严重程度相关。
Gut Microbes. 2022 Jan-Dec;14(1):2028366. doi: 10.1080/19490976.2022.2028366.
8
Multimodal deep learning applied to classify healthy and disease states of human microbiome.多模态深度学习在人类微生物组健康和疾病状态分类中的应用。
Sci Rep. 2022 Jan 17;12(1):824. doi: 10.1038/s41598-022-04773-3.
9
A Goldilocks Principle for the Gut Microbiome: Taxonomic Resolution Matters for Microbiome-Based Classification of Colorectal Cancer.肠道微生物组的金发姑娘原则:分类分辨率对于基于微生物组的结直肠癌分类很重要。
mBio. 2022 Feb 22;13(1):e0316121. doi: 10.1128/mbio.03161-21. Epub 2022 Jan 11.
10
Determine independent gut microbiota-diseases association by eliminating the effects of human lifestyle factors.通过消除人类生活方式因素的影响来确定独立的肠道微生物群-疾病关联。
BMC Microbiol. 2022 Jan 3;22(1):4. doi: 10.1186/s12866-021-02414-9.