• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于纵向微生物组的可解释机器学习用于在疾病结局早期预测中识别随时间变化的生物标志物。

Longitudinal Microbiome-based Interpretable Machine Learning for Identification of Time-Varying Biomarkers in Early Prediction of Disease Outcomes.

作者信息

Dai Yifan, Qian Yunzhi, Qu Yixiang, Guan Wyliena, Xie Jialiu, Wang Duan, Butler Catherine, Dashper Stuart, Carroll Ian, Divaris Kimon, Liu Yufeng, Wu Di

机构信息

Department of Biostatistics, Gillings School of Global Public Health at University of North Carolina at Chapel Hill.

Department of Nutrition, Gillings School of Global Public Health at University of North Carolina at Chapel Hill.

出版信息

bioRxiv. 2024 Nov 20:2024.10.18.619118. doi: 10.1101/2024.10.18.619118.

DOI:10.1101/2024.10.18.619118
PMID:39605360
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11601495/
Abstract

Information generated from longitudinally-sampled microbial data has the potential to illuminate important aspects of development and progression for many human conditions and diseases. Identifying microbial biomarkers and their time-varying effects can not only advance our understanding of pathogenetic mechanisms, but also facilitate early diagnosis and guide optimal timing of interventions. However, longitudinal predictive modeling of highly noisy and dynamic microbial data (e.g., metagenomics) poses analytical challenges. To overcome these challenges, we introduce a robust and interpretable machine-learning-based longitudinal microbiome analysis framework, LP-Micro, that encompasses: (i) longitudinal microbial feature screening via a polynomial group lasso, (ii) disease outcome prediction implemented via machine learning methods (e.g., XGBoost, deep neural networks), and (iii) interpretable association testing between time points, microbial features, and disease outcomes via permutation feature importance. We demonstrate in simulations that LP-Micro can not only identify incident disease-related microbiome taxa but also offers improved prediction accuracy compared to existing approaches. Applications of LP-Micro in two longitudinal microbiome studies with clinical outcomes of childhood dental disease and weight loss following bariatric surgery yield consistently high prediction accuracy. The identified critical early predictive time points are informative and aligned with clinical expectations.

摘要

从纵向采样的微生物数据中生成的信息有可能揭示许多人类疾病和病症发展与进展的重要方面。识别微生物生物标志物及其随时间变化的影响,不仅可以促进我们对致病机制的理解,还能有助于早期诊断并指导干预的最佳时机。然而,对高度嘈杂且动态的微生物数据(例如宏基因组学数据)进行纵向预测建模带来了分析挑战。为了克服这些挑战,我们引入了一个强大且可解释的基于机器学习的纵向微生物组分析框架LP-Micro,该框架包括:(i)通过多项式组套索进行纵向微生物特征筛选;(ii)通过机器学习方法(例如XGBoost、深度神经网络)实现疾病结局预测;以及(iii)通过排列特征重要性对时间点、微生物特征和疾病结局之间进行可解释的关联测试。我们在模拟中证明,LP-Micro不仅可以识别与疾病发生相关的微生物组分类群,而且与现有方法相比,还能提供更高的预测准确性。LP-Micro在两项纵向微生物组研究中的应用,一项研究儿童牙科疾病的临床结局,另一项研究减肥手术后的体重减轻情况,均产生了始终如一的高预测准确性。所确定的关键早期预测时间点具有参考价值,并且与临床预期相符。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/7da3edea4c14/nihpp-2024.10.18.619118v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/87d1ddcc9a1e/nihpp-2024.10.18.619118v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/78620102163f/nihpp-2024.10.18.619118v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/214ae5b45cac/nihpp-2024.10.18.619118v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/52937b2cca1a/nihpp-2024.10.18.619118v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/e5c55f01e241/nihpp-2024.10.18.619118v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/765f92567481/nihpp-2024.10.18.619118v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/7da3edea4c14/nihpp-2024.10.18.619118v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/87d1ddcc9a1e/nihpp-2024.10.18.619118v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/78620102163f/nihpp-2024.10.18.619118v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/214ae5b45cac/nihpp-2024.10.18.619118v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/52937b2cca1a/nihpp-2024.10.18.619118v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/e5c55f01e241/nihpp-2024.10.18.619118v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/765f92567481/nihpp-2024.10.18.619118v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d5b/11601495/7da3edea4c14/nihpp-2024.10.18.619118v2-f0007.jpg

相似文献

1
Longitudinal Microbiome-based Interpretable Machine Learning for Identification of Time-Varying Biomarkers in Early Prediction of Disease Outcomes.基于纵向微生物组的可解释机器学习用于在疾病结局早期预测中识别随时间变化的生物标志物。
bioRxiv. 2024 Nov 20:2024.10.18.619118. doi: 10.1101/2024.10.18.619118.
2
A deep learning feature importance test framework for integrating informative high-dimensional biomarkers to improve disease outcome prediction.一种用于整合信息丰富的高维生物标志物以改善疾病预后预测的深度学习特征重要性测试框架。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae709.
3
High-dimensional biomarker identification for interpretable disease prediction via machine learning models.通过机器学习模型进行可解释疾病预测的高维生物标志物识别
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf266.
4
Robust prediction of colorectal cancer via gut microbiome 16S rRNA sequencing data.通过肠道微生物组 16S rRNA 测序数据进行稳健的结直肠癌预测。
J Med Microbiol. 2024 Oct;73(10). doi: 10.1099/jmm.0.001903.
5
High-dimensional Biomarker Identification for Scalable and Interpretable Disease Prediction via Machine Learning Models.通过机器学习模型进行可扩展且可解释的疾病预测的高维生物标志物识别
bioRxiv. 2024 Oct 7:2024.10.04.616748. doi: 10.1101/2024.10.04.616748.
6
MDITRE: Scalable and Interpretable Machine Learning for Predicting Host Status from Temporal Microbiome Dynamics.MDITRE:基于时间微生物组动态预测宿主状态的可扩展和可解释的机器学习。
mSystems. 2022 Oct 26;7(5):e0013222. doi: 10.1128/msystems.00132-22. Epub 2022 Sep 7.
7
Machine Learning Reveals Time-Varying Microbial Predictors with Complex Effects on Glucose Regulation.机器学习揭示对血糖调节具有复杂影响的随时间变化的微生物预测因子。
mSystems. 2021 Feb 16;6(1):e01191-20. doi: 10.1128/mSystems.01191-20.
8
An Interpretable Longitudinal Preeclampsia Risk Prediction Using Machine Learning.一种使用机器学习的可解释性纵向子痫前期风险预测
medRxiv. 2023 Aug 16:2023.08.16.23293946. doi: 10.1101/2023.08.16.23293946.
9
Interpretable and accurate prediction models for metagenomics data.可解释且准确的宏基因组学数据预测模型。
Gigascience. 2020 Mar 1;9(3). doi: 10.1093/gigascience/giaa010.
10
The Impact of Time Horizon on Classification Accuracy: Application of Machine Learning to Prediction of Incident Coronary Heart Disease.时间范围对分类准确性的影响:机器学习在预测冠心病发病中的应用。
JMIR Cardio. 2022 Nov 2;6(2):e38040. doi: 10.2196/38040.

引用本文的文献

1
Cellular mechanical memory: a potential tool for mesenchymal stem cell-based therapy.细胞机械记忆:基于间充质干细胞治疗的潜在工具。
Stem Cell Res Ther. 2025 Mar 31;16(1):159. doi: 10.1186/s13287-025-04249-x.

本文引用的文献

1
Scalable log-ratio lasso regression for enhanced microbial feature selection with FLORAL.可扩展对数比套索回归增强微生物特征选择的 FLORAL。
Cell Rep Methods. 2024 Nov 18;4(11):100899. doi: 10.1016/j.crmeth.2024.100899. Epub 2024 Nov 7.
2
Effect of bottled fluoridated water to prevent dental caries in primary teeth: study protocol for a phase 2 parallel-group 3.5-year randomized controlled clinical trial (waterBEST).瓶装氟化水预防乳牙龋齿的效果:一项为期 3.5 年的 2 期平行组随机对照临床试验(waterBEST)研究方案。
Trials. 2024 Mar 5;25(1):167. doi: 10.1186/s13063-024-08000-4.
3
Methodological Considerations in Longitudinal Analyses of Microbiome Data: A Comprehensive Review.
纵向分析微生物组数据的方法学考虑:全面综述。
Genes (Basel). 2023 Dec 28;15(1):0. doi: 10.3390/genes15010051.
4
Machine learning-based feature selection to search stable microbial biomarkers: application to inflammatory bowel disease.基于机器学习的特征选择搜索稳定的微生物生物标志物:在炎症性肠病中的应用。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad083. Epub 2023 Oct 26.
5
Generalized Sparse Additive Models.广义稀疏加性模型
J Mach Learn Res. 2022 Jan-Dec;23.
6
Selenomonas sputigena acts as a pathobiont mediating spatial structure and biofilm virulence in early childhood caries.唾液链球菌(Selenomonas sputigena)作为一种条件致病菌,介导婴幼儿龋病的空间结构和生物膜毒力。
Nat Commun. 2023 May 22;14(1):2919. doi: 10.1038/s41467-023-38346-3.
7
Schaalia (Formerly Actinomyces) turicensis Infection Following Open Rotator Cuff Repair.开放性肩袖修复术后突尼斯沙利亚菌(原放线菌属)感染
Cureus. 2023 Jan 26;15(1):e34242. doi: 10.7759/cureus.34242. eCollection 2023 Jan.
8
Gut Microbiota and Metabolites May Play a Crucial Role in Sea Cucumber Aestivation.肠道微生物群和代谢产物可能在海参夏眠中起关键作用。
Microorganisms. 2023 Feb 7;11(2):416. doi: 10.3390/microorganisms11020416.
9
Comparison of weight loss outcomes between Roux-en-Y gastric bypass and sleeve gastrectomy in a racially mixed urban patient population.在一个种族混合的城市患者人群中比较 Roux-en-Y 胃旁路术和袖状胃切除术的减肥效果。
Surg Obes Relat Dis. 2022 Oct;18(10):1218-1227. doi: 10.1016/j.soard.2022.05.025. Epub 2022 Jun 1.
10
The microbiome and gut homeostasis.微生物组与肠道内稳态。
Science. 2022 Jul;377(6601):eabp9960. doi: 10.1126/science.abp9960. Epub 2022 Jul 1.