mbDecoda：一种用于微生物组调查的成分数据分析的偏差校正方法。

mbDecoda: a debiased approach to compositional data analysis for microbiome surveys.

作者信息

Zong Yuxuan, Zhao Hongyu, Wang Tao

机构信息

Department of Bioinformatics and Biostatistics, Shanghai Jiao Tong University, Shanghai, China.

SJTU-Yale Joint Center of Biostatistics and Data Science, Shanghai Jiao Tong University, Shanghai, China.

出版信息

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae205.

DOI:10.1093/bib/bbae205

PMID:38701410

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11066923/

Abstract

Potentially pathogenic or probiotic microbes can be identified by comparing their abundance levels between healthy and diseased populations, or more broadly, by linking microbiome composition with clinical phenotypes or environmental factors. However, in microbiome studies, feature tables provide relative rather than absolute abundance of each feature in each sample, as the microbial loads of the samples and the ratios of sequencing depth to microbial load are both unknown and subject to considerable variation. Moreover, microbiome abundance data are count-valued, often over-dispersed and contain a substantial proportion of zeros. To carry out differential abundance analysis while addressing these challenges, we introduce mbDecoda, a model-based approach for debiased analysis of sparse compositions of microbiomes. mbDecoda employs a zero-inflated negative binomial model, linking mean abundance to the variable of interest through a log link function, and it accommodates the adjustment for confounding factors. To efficiently obtain maximum likelihood estimates of model parameters, an Expectation Maximization algorithm is developed. A minimum coverage interval approach is then proposed to rectify compositional bias, enabling accurate and reliable absolute abundance analysis. Through extensive simulation studies and analysis of real-world microbiome datasets, we demonstrate that mbDecoda compares favorably with state-of-the-art methods in terms of effectiveness, robustness and reproducibility.

摘要

通过比较健康人群和患病群体之间潜在致病或益生菌微生物的丰度水平，或者更广泛地说，通过将微生物组组成与临床表型或环境因素联系起来，可以识别这些微生物。然而，在微生物组研究中，特征表提供的是每个样本中每个特征的相对丰度而非绝对丰度，因为样本的微生物载量以及测序深度与微生物载量的比率均未知且变化很大。此外，微生物组丰度数据是计数值，通常过度分散且包含相当比例的零值。为了在应对这些挑战的同时进行差异丰度分析，我们引入了mbDecoda，这是一种基于模型的方法，用于对微生物组的稀疏组成进行偏差校正分析。mbDecoda采用零膨胀负二项式模型，通过对数链接函数将平均丰度与感兴趣的变量联系起来，并对混杂因素进行调整。为了有效地获得模型参数的最大似然估计，开发了一种期望最大化算法。然后提出了一种最小覆盖区间方法来纠正组成偏差，从而实现准确可靠的绝对丰度分析。通过广泛的模拟研究和对真实世界微生物组数据集的分析，我们证明mbDecoda在有效性、稳健性和可重复性方面优于现有方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/711d/11066923/34eba2fbb0eb/bbae205f1.jpg

相似文献

mbDecoda: a debiased approach to compositional data analysis for microbiome surveys.mbDecoda：一种用于微生物组调查的成分数据分析的偏差校正方法。

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae205.

Zero-inflated generalized Dirichlet multinomial regression model for microbiome compositional data analysis.用于微生物组组成数据分析的零膨胀广义狄利克雷多项回归模型。

Biostatistics. 2019 Oct 1;20(4):698-713. doi: 10.1093/biostatistics/kxy025.

LOCOM: A logistic regression model for testing differential abundance in compositional microbiome data with false discovery rate control.LOCOM：一种用于检验微生物组数据中丰度差异的逻辑回归模型，具有错误发现率控制。

Proc Natl Acad Sci U S A. 2022 Jul 26;119(30):e2122788119. doi: 10.1073/pnas.2122788119. Epub 2022 Jul 22.

Analyzing the overall effects of the microbiome abundance data with a Bayesian predictive value approach.采用贝叶斯预测价值方法分析微生物组丰度数据的总体影响。

Stat Methods Med Res. 2022 Oct;31(10):1992-2003. doi: 10.1177/09622802221107106. Epub 2022 Jun 12.

Zero-Inflated gaussian mixed models for analyzing longitudinal microbiome data.用于分析纵向微生物组数据的零膨胀高斯混合模型

PLoS One. 2020 Nov 9;15(11):e0242073. doi: 10.1371/journal.pone.0242073. eCollection 2020.

Transformation and differential abundance analysis of microbiome data incorporating phylogeny.整合系统发育信息的微生物组数据的转化和差异丰度分析。

Bioinformatics. 2021 Dec 11;37(24):4652-4660. doi: 10.1093/bioinformatics/btab543.

MarZIC: A Marginal Mediation Model for Zero-Inflated Compositional Mediators with Applications to Microbiome Data.MarZIC：一种用于零膨胀成分中介的边缘中介模型及其在微生物组数据中的应用。

Genes (Basel). 2022 Jun 11;13(6):1049. doi: 10.3390/genes13061049.

mbDenoise: microbiome data denoising using zero-inflated probabilistic principal components analysis.mbDenoise：使用零膨胀概率主成分分析的微生物组数据去噪

Genome Biol. 2022 Apr 14;23(1):94. doi: 10.1186/s13059-022-02657-3.

An assessment of compositional methods for the analysis of DNA methylation-based deconvolution estimates.基于 DNA 甲基化去卷积估计的组成方法分析评估。

Epigenomics. 2024;16(15-16):1067-1080. doi: 10.1080/17501911.2024.2379242. Epub 2024 Aug 2.

A strategy for differential abundance analysis of sparse microbiome data with group-wise structured zeros.一种针对具有组间结构零的稀疏微生物组数据的差异丰度分析策略。

Sci Rep. 2024 May 30;14(1):12433. doi: 10.1038/s41598-024-62437-w.

本文引用的文献

Modulating the Human Gut Microbiota through Hypocaloric Balanced Diets: An Effective Approach for Managing Obesity.通过低热量均衡饮食调节人体肠道微生物群：管理肥胖的有效方法。

Nutrients. 2023 Jul 11;15(14):3101. doi: 10.3390/nu15143101.

Alterations in Microbiota and Metabolites Related to Spontaneous Diabetes and Pre-Diabetes in Rhesus Macaques.恒河猴自发性糖尿病及糖尿病前期相关的微生物群和代谢物的改变。

Genes (Basel). 2022 Aug 24;13(9):1513. doi: 10.3390/genes13091513.

Proc Natl Acad Sci U S A. 2022 Jul 26;119(30):e2122788119. doi: 10.1073/pnas.2122788119. Epub 2022 Jul 22.

Changes in the Mucosa-Associated Microbiome and Transcriptome across Gut Segments Are Associated with Obesity in a Metabolic Syndrome Porcine Model.黏膜相关微生物组和转录组在肠道各节段的变化与代谢综合征猪模型中的肥胖有关。

Microbiol Spectr. 2022 Aug 31;10(4):e0071722. doi: 10.1128/spectrum.00717-22. Epub 2022 Jul 7.

LinDA: linear models for differential abundance analysis of microbiome compositional data.LinDA：用于微生物组组成数据差异丰度分析的线性模型

Genome Biol. 2022 Apr 14;23(1):95. doi: 10.1186/s13059-022-02655-5.

Zero-inflated Poisson models with measurement error in the response.带有响应测量误差的零膨胀泊松模型。

Biometrics. 2023 Jun;79(2):1089-1102. doi: 10.1111/biom.13657. Epub 2022 Apr 20.

fastANCOM: a fast method for analysis of compositions of microbiomes.fastANCOM：一种用于微生物群落组成分析的快速方法。

Bioinformatics. 2022 Mar 28;38(7):2039-2041. doi: 10.1093/bioinformatics/btac060.

MZINBVA: variational approximation for multilevel zero-inflated negative-binomial models for association analysis in microbiome surveys.MZINBVA：用于宏基因组调查中关联分析的多水平零膨胀负二项式模型的变分逼近。

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab443.

Benchmarking microbiome transformations favors experimental quantitative approaches to address compositionality and sampling depth biases.基准测试微生物组转化有利于采用实验定量方法来解决组成和采样深度偏差问题。

Nat Commun. 2021 Jun 11;12(1):3562. doi: 10.1038/s41467-021-23821-6.

The Fecal Microbiota Is Already Altered in Normoglycemic Individuals Who Go on to Have Type 2 Diabetes.在后来发展为 2 型糖尿病的血糖正常个体中，粪便微生物群已经发生改变。

Front Cell Infect Microbiol. 2021 Feb 18;11:598672. doi: 10.3389/fcimb.2021.598672. eCollection 2021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

mbDecoda：一种用于微生物组调查的成分数据分析的偏差校正方法。

mbDecoda: a debiased approach to compositional data analysis for microbiome surveys.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献