• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过机器学习开发基于微生物组的疾病诊断分类器的最佳实践。

Best practices for developing microbiome-based disease diagnostic classifiers through machine learning.

作者信息

Li Peikun, Li Min, Chen Wei-Hua

机构信息

Key Laboratory of Molecular Biophysics of the Ministry of Education, Hubei Key Laboratory of Bioinformatics and Molecular Imaging, Center for Artificial Intelligence Biology, Department of Bioinformatics and Systems Biology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei, China.

School of Biological Science, Jining Medical University, Rizhao, China.

出版信息

Gut Microbes. 2025 Dec;17(1):2489074. doi: 10.1080/19490976.2025.2489074. Epub 2025 Apr 4.

DOI:10.1080/19490976.2025.2489074
PMID:40186338
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11980492/
Abstract

The human gut microbiome, crucial in various diseases, can be utilized to develop diagnostic models through machine learning (ML). The specific tools and parameters used in model construction such as data preprocessing, batch effect removal and modeling algorithms can impact model performance and generalizability. To establish an generally applicable workflow, we divided the ML process into three above-mentioned steps and optimized each sequentially using 83 gut microbiome cohorts across 20 diseases. We tested a total of 156 tool-parameter-algorithm combinations and benchmarked them according to internal- and external- AUCs. At the data preprocessing step, we identified four data preprocessing methods that performed well for regression-type algorithms and one method that excelled for non-regression-type algorithms. At the batch effect removal step, we identified the "ComBat" function from the R package as an effective batch effect removal method and compared the performance of various algorithms. Finally, at the ML algorithm selection step, we found that Ridge and Random Forest ranked the best. Our optimized work flow performed similarly comparing with previous exhaustive methods for disease-specific optimizations, thus is generally applicable and can provide a comprehensive guideline for constructing diagnostic models for a range of diseases, potentially serving as a powerful tool for future medical diagnostics.

摘要

人类肠道微生物群在多种疾病中起着关键作用,可通过机器学习(ML)用于开发诊断模型。模型构建中使用的特定工具和参数,如数据预处理、批次效应消除和建模算法,会影响模型性能和通用性。为了建立一个普遍适用的工作流程,我们将ML过程分为上述三个步骤,并使用来自20种疾病的83个肠道微生物群队列依次对每个步骤进行优化。我们总共测试了156种工具-参数-算法组合,并根据内部和外部AUC对它们进行基准测试。在数据预处理步骤中,我们确定了四种对回归型算法表现良好的数据预处理方法和一种对非回归型算法表现出色的方法。在批次效应消除步骤中,我们从R包中确定了“ComBat”函数作为一种有效的批次效应消除方法,并比较了各种算法的性能。最后,在ML算法选择步骤中,我们发现岭回归和随机森林排名最佳。我们优化的工作流程与之前针对特定疾病优化的详尽方法相比表现相似,因此具有普遍适用性,可以为构建一系列疾病的诊断模型提供全面的指导方针,有可能成为未来医学诊断的有力工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3173/11980492/e2f957caef77/KGMI_A_2489074_F0004_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3173/11980492/caf8baabf486/KGMI_A_2489074_F0001_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3173/11980492/a4568869fd29/KGMI_A_2489074_F0002_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3173/11980492/8503ac47d688/KGMI_A_2489074_F0003_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3173/11980492/e2f957caef77/KGMI_A_2489074_F0004_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3173/11980492/caf8baabf486/KGMI_A_2489074_F0001_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3173/11980492/a4568869fd29/KGMI_A_2489074_F0002_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3173/11980492/8503ac47d688/KGMI_A_2489074_F0003_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3173/11980492/e2f957caef77/KGMI_A_2489074_F0004_OC.jpg

相似文献

1
Best practices for developing microbiome-based disease diagnostic classifiers through machine learning.通过机器学习开发基于微生物组的疾病诊断分类器的最佳实践。
Gut Microbes. 2025 Dec;17(1):2489074. doi: 10.1080/19490976.2025.2489074. Epub 2025 Apr 4.
2
Comprehensive assessment of machine learning methods for diagnosing gastrointestinal diseases through whole metagenome sequencing data.通过全宏基因组测序数据对用于诊断胃肠道疾病的机器学习方法进行综合评估。
Gut Microbes. 2024 Jan-Dec;16(1):2375679. doi: 10.1080/19490976.2024.2375679. Epub 2024 Jul 7.
3
Benchmark of Data Processing Methods and Machine Learning Models for Gut Microbiome-Based Diagnosis of Inflammatory Bowel Disease.基于肠道微生物群的炎症性肠病诊断的数据处理方法和机器学习模型基准
Front Genet. 2022 Feb 14;13:784397. doi: 10.3389/fgene.2022.784397. eCollection 2022.
4
Performance of Gut Microbiome as an Independent Diagnostic Tool for 20 Diseases: Cross-Cohort Validation of Machine-Learning Classifiers.肠道微生物组作为 20 种疾病独立诊断工具的性能:基于机器学习分类器的跨队列验证。
Gut Microbes. 2023 Jan-Dec;15(1):2205386. doi: 10.1080/19490976.2023.2205386.
5
Gut microbiome-based machine learning for diagnostic prediction of liver fibrosis and cirrhosis: a systematic review and meta-analysis.基于肠道微生物组的机器学习用于诊断预测肝纤维化和肝硬化:系统评价和荟萃分析。
BMC Med Inform Decis Mak. 2023 Dec 19;23(1):294. doi: 10.1186/s12911-023-02402-1.
6
Gut microbiota-based machine-learning signature for the diagnosis of alcohol-associated and metabolic dysfunction-associated steatotic liver disease.基于肠道微生物组的机器学习特征用于诊断酒精相关和代谢功能障碍相关脂肪性肝病。
Sci Rep. 2024 Jul 12;14(1):16122. doi: 10.1038/s41598-024-60768-2.
7
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
8
Highly sensitive detection platform-based diagnosis of oesophageal squamous cell carcinoma in China: a multicentre, case-control, diagnostic study.基于高灵敏度检测平台的中国食管鳞状细胞癌诊断:一项多中心、病例对照诊断研究。
Lancet Digit Health. 2024 Oct;6(10):e705-e717. doi: 10.1016/S2589-7500(24)00153-5.
9
The Prediction of Sinter Drums Strength Using Hybrid Machine Learning Algorithms.基于混合机器学习算法的烧结矿鼓强度预测
Comput Intell Neurosci. 2022 Jul 7;2022:4790736. doi: 10.1155/2022/4790736. eCollection 2022.
10
Machine Learning Strategy for Gut Microbiome-Based Diagnostic Screening of Cardiovascular Disease.基于肠道微生物组的心血管疾病诊断筛查的机器学习策略。
Hypertension. 2020 Nov;76(5):1555-1562. doi: 10.1161/HYPERTENSIONAHA.120.15885. Epub 2020 Sep 10.

本文引用的文献

1
Unveiling the Power of Gut Microbiome in Predicting Neoadjuvant Immunochemotherapy Responses in Esophageal Squamous Cell Carcinoma.揭示肠道微生物群在预测食管鳞状细胞癌新辅助免疫化疗反应中的作用
Research (Wash D C). 2024 Nov 14;7:0529. doi: 10.34133/research.0529. eCollection 2024.
2
Comprehensive assessment of machine learning methods for diagnosing gastrointestinal diseases through whole metagenome sequencing data.通过全宏基因组测序数据对用于诊断胃肠道疾病的机器学习方法进行综合评估。
Gut Microbes. 2024 Jan-Dec;16(1):2375679. doi: 10.1080/19490976.2024.2375679. Epub 2024 Jul 7.
3
Gut microbiome features and metabolites in non-alcoholic fatty liver disease among community-dwelling middle-aged and older adults.
社区中老年人群非酒精性脂肪性肝病的肠道微生物群特征及代谢产物。
BMC Med. 2024 Mar 7;22(1):104. doi: 10.1186/s12916-024-03317-y.
4
The impact of gut microbiome enterotypes on ulcerative colitis: identifying key bacterial species and revealing species co-occurrence networks using machine learning.肠道微生物组 enterotypes 对溃疡性结肠炎的影响:使用机器学习识别关键细菌物种并揭示物种共存网络。
Gut Microbes. 2024 Jan-Dec;16(1):2292254. doi: 10.1080/19490976.2023.2292254. Epub 2023 Dec 20.
5
Microbiota in Autism Spectrum Disorder: A Systematic Review.自闭症谱系障碍中的微生物群:系统评价。
Int J Mol Sci. 2023 Nov 23;24(23):16660. doi: 10.3390/ijms242316660.
6
Machine learning strategy for identifying altered gut microbiomes for diagnostic screening in myasthenia gravis.用于识别重症肌无力诊断筛查中肠道微生物群改变的机器学习策略
Front Microbiol. 2023 Sep 27;14:1227300. doi: 10.3389/fmicb.2023.1227300. eCollection 2023.
7
Systematic review: Autism spectrum disorder and the gut microbiota.系统评价:自闭症谱系障碍与肠道微生物群。
Acta Psychiatr Scand. 2023 Sep;148(3):242-254. doi: 10.1111/acps.13587. Epub 2023 Jul 3.
8
Performance of Gut Microbiome as an Independent Diagnostic Tool for 20 Diseases: Cross-Cohort Validation of Machine-Learning Classifiers.肠道微生物组作为 20 种疾病独立诊断工具的性能:基于机器学习分类器的跨队列验证。
Gut Microbes. 2023 Jan-Dec;15(1):2205386. doi: 10.1080/19490976.2023.2205386.
9
Extending and improving metagenomic taxonomic profiling with uncharacterized species using MetaPhlAn 4.利用 MetaPhlAn 4 对未鉴定物种进行宏基因组分类分析的扩展和改进。
Nat Biotechnol. 2023 Nov;41(11):1633-1644. doi: 10.1038/s41587-023-01688-w. Epub 2023 Feb 23.
10
A Systematic Review of Mixed Studies Exploring the Effects of Probiotics on Gut-Microbiome to Modulate Therapy in Children With Autism Spectrum Disorder.一项关于混合研究的系统评价,探讨益生菌对肠道微生物群的影响以调节自闭症谱系障碍儿童的治疗。
Cureus. 2022 Dec 8;14(12):e32313. doi: 10.7759/cureus.32313. eCollection 2022 Dec.