• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用DEBIAS-M进行处理偏差校正可提高基于微生物组的预测模型的跨研究泛化能力。

Processing-bias correction with DEBIAS-M improves cross-study generalization of microbiome-based prediction models.

作者信息

Austin George I, Brown Kav Aya, ElNaggar Shahd, Park Heekuk, Biermann Jana, Uhlemann Anne-Catrin, Pe'er Itsik, Korem Tal

机构信息

Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.

Program for Mathematical Genomics, Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA.

出版信息

Nat Microbiol. 2025 Apr;10(4):897-911. doi: 10.1038/s41564-025-01954-4. Epub 2025 Mar 27.

DOI:10.1038/s41564-025-01954-4
PMID:40148567
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12087262/
Abstract

Every step in common microbiome profiling protocols has variable efficiency for each microbe, for example, different DNA extraction efficiency for Gram-positive bacteria. These processing biases impede the identification of signals that are biologically interpretable and generalizable across studies. 'Batch-correction' methods have been used to address these issues computationally with some success, but they are largely non-interpretable and often require the use of an outcome variable in a manner that risks overfitting. We present DEBIAS-M (domain adaptation with phenotype estimation and batch integration across studies of the microbiome), an interpretable framework for inference and correction of processing bias, which facilitates domain adaptation in microbiome studies. DEBIAS-M learns bias-correction factors for each microbe in each batch that simultaneously minimize batch effects and maximize cross-study associations with phenotypes. Using diverse benchmarks including 16S rRNA and metagenomic sequencing, classification and regression, and a variety of clinical and molecular targets, we demonstrate that using DEBIAS-M improves cross-study prediction accuracy compared with commonly used batch-correction methods. Notably, we show that the inferred bias-correction factors are stable, interpretable and strongly associated with specific experimental protocols. Overall, we show that DEBIAS-M facilitates improved modelling of microbiome data and identification of interpretable signals that generalize across studies.

摘要

常见微生物组分析方案中的每一步对每种微生物的效率都有所不同,例如,革兰氏阳性菌的DNA提取效率就不同。这些处理偏差阻碍了对具有生物学可解释性且能在不同研究中通用的信号的识别。“批次校正”方法已被用于通过计算解决这些问题并取得了一些成功,但它们在很大程度上难以解释,并且通常需要以存在过度拟合风险的方式使用结果变量。我们提出了DEBIAS-M(微生物组跨研究的表型估计和批次整合的域适应),这是一个用于推断和校正处理偏差的可解释框架,它有助于微生物组研究中的域适应。DEBIAS-M为每个批次中的每种微生物学习偏差校正因子,同时最小化批次效应并最大化与表型的跨研究关联。使用包括16S rRNA和宏基因组测序、分类和回归以及各种临床和分子靶点在内的多种基准,我们证明与常用的批次校正方法相比,使用DEBIAS-M可提高跨研究预测准确性。值得注意的是,我们表明推断出的偏差校正因子是稳定的、可解释的,并且与特定的实验方案密切相关。总体而言,我们表明DEBIAS-M有助于改进微生物组数据建模,并识别在不同研究中通用的可解释信号。

相似文献

1
Processing-bias correction with DEBIAS-M improves cross-study generalization of microbiome-based prediction models.使用DEBIAS-M进行处理偏差校正可提高基于微生物组的预测模型的跨研究泛化能力。
Nat Microbiol. 2025 Apr;10(4):897-911. doi: 10.1038/s41564-025-01954-4. Epub 2025 Mar 27.
2
Processing-bias correction with DEBIAS-M improves cross-study generalization of microbiome-based prediction models.使用DEBIAS-M进行处理偏差校正可提高基于微生物组的预测模型的跨研究泛化能力。
bioRxiv. 2024 Feb 12:2024.02.09.579716. doi: 10.1101/2024.02.09.579716.
3
De-biasing microbiome sequencing data: bacterial morphology-based correction of extraction bias and correlates of chimera formation.微生物组测序数据的去偏倚:基于细菌形态学对提取偏差的校正及嵌合体形成的相关因素
Microbiome. 2025 Feb 4;13(1):38. doi: 10.1186/s40168-024-01998-4.
4
The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies.宏基因组学的真相:量化和抵消16S rRNA研究中的偏差
BMC Microbiol. 2015 Mar 21;15:66. doi: 10.1186/s12866-015-0351-6.
5
Inference of Environmental Factor-Microbe and Microbe-Microbe Associations from Metagenomic Data Using a Hierarchical Bayesian Statistical Model.基于分层贝叶斯统计模型从宏基因组数据中推断环境因子-微生物和微生物-微生物的关联。
Cell Syst. 2017 Jan 25;4(1):129-137.e5. doi: 10.1016/j.cels.2016.12.012.
6
A sensitivity analysis of methodological variables associated with microbiome measurements.与微生物组测量相关的方法学变量的敏感性分析。
Microbiol Spectr. 2025 Feb 4;13(2):e0069624. doi: 10.1128/spectrum.00696-24. Epub 2025 Jan 14.
7
Variability and bias in microbiome metagenomic sequencing: an interlaboratory study comparing experimental protocols.微生物组宏基因组测序中的变异性和偏差:比较实验方案的实验室间研究。
Sci Rep. 2024 Apr 29;14(1):9785. doi: 10.1038/s41598-024-57981-4.
8
Freshwater monitoring by nanopore sequencing.利用纳米孔测序进行淡水监测。
Elife. 2021 Jan 19;10:e61504. doi: 10.7554/eLife.61504.
9
High-throughput DNA extraction strategy for fecal microbiome studies.高通量粪便微生物组研究的 DNA 提取策略。
Microbiol Spectr. 2024 Jun 4;12(6):e0293223. doi: 10.1128/spectrum.02932-23. Epub 2024 May 15.
10
Large-scale benchmarking reveals false discoveries and count transformation sensitivity in 16S rRNA gene amplicon data analysis methods used in microbiome studies.大规模基准测试揭示了微生物组研究中使用的 16S rRNA 基因扩增子数据分析方法中的假发现和计数转换敏感性。
Microbiome. 2016 Nov 25;4(1):62. doi: 10.1186/s40168-016-0208-8.

引用本文的文献

1
Microbiome data integration via shared dictionary learning.通过共享字典学习进行微生物组数据整合。
Nat Commun. 2025 Sep 1;16(1):8147. doi: 10.1038/s41467-025-63425-y.
2
Compositional transformations can reasonably introduce phenotype-associated values into sparse features.成分转换可以合理地将与表型相关的值引入稀疏特征中。
mSystems. 2025 May 20;10(5):e0002125. doi: 10.1128/msystems.00021-25. Epub 2025 May 2.
3
Early prediction of preeclampsia using the first trimester vaginal microbiome.利用孕早期阴道微生物群对先兆子痫进行早期预测。

本文引用的文献

1
Compositional transformations can reasonably introduce phenotype-associated values into sparse features.成分转换可以合理地将与表型相关的值引入稀疏特征中。
mSystems. 2025 May 20;10(5):e0002125. doi: 10.1128/msystems.00021-25. Epub 2025 May 2.
2
Domain adaptation in small-scale and heterogeneous biological datasets.小规模和异构生物数据集中的域适应
Sci Adv. 2024 Dec 20;10(51):eadp6040. doi: 10.1126/sciadv.adp6040.
3
Microbiome preterm birth DREAM challenge: Crowdsourcing machine learning approaches to advance preterm birth research.
bioRxiv. 2024 Dec 2:2024.12.01.626267. doi: 10.1101/2024.12.01.626267.
微生物组早产 DREAM 挑战赛:众包机器学习方法以推进早产研究。
Cell Rep Med. 2024 Jan 16;5(1):101350. doi: 10.1016/j.xcrm.2023.101350. Epub 2023 Dec 21.
4
Major data analysis errors invalidate cancer microbiome findings.主要数据分析错误使癌症微生物组研究结果无效。
mBio. 2023 Oct 31;14(5):e0160723. doi: 10.1128/mbio.01607-23. Epub 2023 Oct 9.
5
Human microbiome myths and misconceptions.人类微生物组的误区和误解。
Nat Microbiol. 2023 Aug;8(8):1392-1396. doi: 10.1038/s41564-023-01426-7. Epub 2023 Jul 31.
6
Contamination source modeling with SCRuB improves cancer phenotype prediction from microbiome data.SCRuB 进行污染来源建模可提高基于微生物组数据的癌症表型预测能力。
Nat Biotechnol. 2023 Dec;41(12):1820-1828. doi: 10.1038/s41587-023-01696-w. Epub 2023 Mar 16.
7
PLSDA-batch: a multivariate framework to correct for batch effects in microbiome data.PLSDA-batch:一种用于校正微生物组数据中批次效应的多元框架。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbac622.
8
Preterm birth is associated with xenobiotics and predicted by the vaginal metabolome.早产与外源性化学物质有关,并可通过阴道代谢组学预测。
Nat Microbiol. 2023 Feb;8(2):246-259. doi: 10.1038/s41564-022-01293-8. Epub 2023 Jan 12.
9
Twenty-five years of Genomes OnLine Database (GOLD): data updates and new features in v.9.25 年的基因组在线数据库(GOLD):v.9 中的数据更新和新功能。
Nucleic Acids Res. 2023 Jan 6;51(D1):D957-D963. doi: 10.1093/nar/gkac974.
10
Population structure discovery in meta-analyzed microbial communities and inflammatory bowel disease using MMUPHin.使用 MMUPHin 发现元分析微生物群落和炎症性肠病中的种群结构。
Genome Biol. 2022 Oct 3;23(1):208. doi: 10.1186/s13059-022-02753-4.