MaAsLin 3：改进和扩展用于宏基因组关联发现的广义多变量线性模型。

MaAsLin 3: Refining and extending generalized multivariable linear models for meta-omic association discovery.

作者信息

Nickols William A, Kuntz Thomas, Shen Jiaxian, Maharjan Sagun, Mallick Himel, Franzosa Eric A, Thompson Kelsey N, Nearing Jacob T, Huttenhower Curtis

机构信息

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.

Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA.

出版信息

bioRxiv. 2024 Dec 14:2024.12.13.628459. doi: 10.1101/2024.12.13.628459.

DOI:10.1101/2024.12.13.628459

PMID:39713460

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11661281/

Abstract

A key question in microbial community analysis is determining which microbial features are associated with community properties such as environmental or health phenotypes. This statistical task is impeded by characteristics of typical microbial community profiling technologies, including sparsity (which can be either technical or biological) and the compositionality imposed by most nucleotide sequencing approaches. Many models have been proposed that focus on how the relative abundance of a feature (e.g. taxon or pathway) relates to one or more covariates. Few of these, however, simultaneously control false discovery rates, achieve reasonable power, incorporate complex modeling terms such as random effects, and also permit assessment of prevalence (presence/absence) associations and absolute abundance associations (when appropriate measurements are available, e.g. qPCR or spike-ins). Here, we introduce MaAsLin 3 (Microbiome Multivariable Associations with Linear Models), a modeling framework that simultaneously identifies both abundance and prevalence relationships in microbiome studies with modern, potentially complex designs. MaAsLin 3 also newly accounts for compositionality with experimental (spike-ins and total microbial load estimation) or computational techniques, and it expands the space of biological hypotheses that can be tested with inference for new covariate types. On a variety of synthetic and real datasets, MaAsLin 3 outperformed current state-of-the-art differential abundance methods in testing and inferring associations from compositional data. When applied to the Inflammatory Bowel Disease Multi-omics Database, MaAsLin 3 corroborated many previously reported microbial associations with the inflammatory bowel diseases, but notably 77% of associations were with feature prevalence rather than abundance. In summary, MaAsLin 3 enables researchers to identify microbiome associations with higher accuracy and more specific association types, especially in complex datasets with multiple covariates and repeated measures.

摘要

微生物群落分析中的一个关键问题是确定哪些微生物特征与群落特性相关，如环境或健康表型。典型的微生物群落分析技术的特点阻碍了这项统计任务，这些特点包括稀疏性（可能是技术上的或生物学上的）以及大多数核苷酸测序方法所带来的组成性。已经提出了许多模型，这些模型关注一个特征（如分类群或通路）的相对丰度如何与一个或多个协变量相关。然而，其中很少有模型能同时控制错误发现率、获得合理的功效、纳入诸如随机效应等复杂的建模项，并且还能允许评估患病率（存在/不存在）关联和绝对丰度关联（当有适当的测量方法时，例如定量聚合酶链反应或内参）。在这里，我们介绍MaAsLin 3（微生物组多变量线性模型关联），这是一个建模框架，它能在具有现代的、可能复杂设计的微生物组研究中同时识别丰度和患病率关系。MaAsLin 3还通过实验（内参和总微生物负荷估计）或计算技术新纳入了组成性，并且它扩展了可以通过对新协变量类型进行推断来检验的生物学假设空间。在各种合成数据集和真实数据集上，MaAsLin 3在从组成数据中测试和推断关联方面优于当前最先进的差异丰度方法。当应用于炎症性肠病多组学数据库时，MaAsLin 3证实了许多先前报道的与炎症性肠病相关的微生物关联，但值得注意的是，77%的关联是与特征患病率而非丰度相关。总之，MaAsLin 3使研究人员能够以更高的准确性和更具体的关联类型识别微生物组关联，特别是在具有多个协变量和重复测量的复杂数据集中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9d5/11661281/d28e8a28e07d/nihpp-2024.12.13.628459v1-f0001.jpg

相似文献

MaAsLin 3: Refining and extending generalized multivariable linear models for meta-omic association discovery.MaAsLin 3：改进和扩展用于宏基因组关联发现的广义多变量线性模型。

bioRxiv. 2024 Dec 14:2024.12.13.628459. doi: 10.1101/2024.12.13.628459.

Multivariable association discovery in population-scale meta-omics studies.基于人群的宏基因组学研究中的多变量关联发现。

PLoS Comput Biol. 2021 Nov 16;17(11):e1009442. doi: 10.1371/journal.pcbi.1009442. eCollection 2021 Nov.

BIRDMAn: A Bayesian differential abundance framework that enables robust inference of host-microbe associations.BIRDMAn：一种贝叶斯差异丰度框架，可实现对宿主-微生物关联的稳健推断。

bioRxiv. 2023 Feb 2:2023.01.30.526328. doi: 10.1101/2023.01.30.526328.

Unfolding and de-confounding: biologically meaningful causal inference from longitudinal multi-omic networks using METALICA.展开与去混淆：利用 METALICA 从纵向多组学网络中进行具有生物学意义的因果推断

mSystems. 2024 Oct 22;9(10):e0130323. doi: 10.1128/msystems.01303-23. Epub 2024 Sep 6.

Normalization and microbial differential abundance strategies depend upon data characteristics.归一化和微生物差异丰度策略取决于数据特征。

Microbiome. 2017 Mar 3;5(1):27. doi: 10.1186/s40168-017-0237-y.

MIDASim: a fast and simple simulator for realistic microbiome data.MIDASim：一个快速而简单的用于真实微生物组数据模拟的工具。

Microbiome. 2024 Jul 22;12(1):135. doi: 10.1186/s40168-024-01822-z.

Microbial Networks in SPRING - Semi-parametric Rank-Based Correlation and Partial Correlation Estimation for Quantitative Microbiome Data.SPRING中的微生物网络——用于定量微生物组数据的基于半参数秩的相关性和偏相关性估计

Front Genet. 2019 Jun 6;10:516. doi: 10.3389/fgene.2019.00516. eCollection 2019.

MIDASim: a fast and simple simulator for realistic microbiome data.MIDASim：一款用于逼真微生物组数据的快速简易模拟器。

bioRxiv. 2024 Mar 27:2023.03.23.533996. doi: 10.1101/2023.03.23.533996.

Correlation and association analyses in microbiome study integrating multiomics in health and disease.在健康和疾病的多组学整合微生物组研究中进行相关性和关联性分析。

Prog Mol Biol Transl Sci. 2020;171:309-491. doi: 10.1016/bs.pmbts.2020.04.003. Epub 2020 May 23.

Poly-omic risk scores predict inflammatory bowel disease diagnosis.多组学风险评分可预测炎症性肠病的诊断。

mSystems. 2024 Jan 23;9(1):e0067723. doi: 10.1128/msystems.00677-23. Epub 2023 Dec 14.

引用本文的文献

MetaDAVis: An R shiny application for metagenomic data analysis and visualization.MetaDAVis：一款用于宏基因组数据分析与可视化的R闪亮应用程序。

PLoS One. 2025 Apr 7;20(4):e0319949. doi: 10.1371/journal.pone.0319949. eCollection 2025.

本文引用的文献

Elementary methods provide more replicable results in microbial differential abundance analysis.在微生物差异丰度分析中，基本方法能提供更具可重复性的结果。

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf130.

Coffee consumption is associated with intestinal Lawsonibacter asaccharolyticus abundance and prevalence across multiple cohorts.咖啡饮用与肠道罗伊氏乳杆菌丰度和多个队列中的流行度相关。

Nat Microbiol. 2024 Dec;9(12):3120-3134. doi: 10.1038/s41564-024-01858-9. Epub 2024 Nov 18.

A realistic benchmark for differential abundance testing and confounder adjustment in human microbiome studies.用于人类微生物组研究中差异丰度检验和混杂因素调整的现实基准。

Genome Biol. 2024 Sep 25;25(1):247. doi: 10.1186/s13059-024-03390-9.

Unique sterol metabolite shifts in inflammatory bowel disease and primary sclerosing cholangitis.炎症性肠病和原发性硬化性胆管炎中独特固醇代谢物的变化。

J Steroid Biochem Mol Biol. 2025 Jan;245:106621. doi: 10.1016/j.jsbmb.2024.106621. Epub 2024 Sep 16.

Discovery of disease-adapted bacterial lineages in inflammatory bowel diseases.炎症性肠病中疾病适应细菌谱系的发现。

Cell Host Microbe. 2024 Jul 10;32(7):1147-1162.e12. doi: 10.1016/j.chom.2024.05.022. Epub 2024 Jun 24.

Gut microbiome and metabolome profiling in Framingham heart study reveals cholesterol-metabolizing bacteria.弗雷明汉心脏研究中的肠道微生物组和代谢组分析揭示了胆固醇代谢细菌。

Cell. 2024 Apr 11;187(8):1834-1852.e19. doi: 10.1016/j.cell.2024.03.014. Epub 2024 Apr 2.

Opposing diet, microbiome, and metabolite mechanisms regulate inflammatory bowel disease in a genetically susceptible host.相反的饮食、微生物组和代谢物机制在遗传易感宿主中调节炎症性肠病。

Cell Host Microbe. 2024 Apr 10;32(4):527-542.e9. doi: 10.1016/j.chom.2024.03.001. Epub 2024 Mar 20.

Multigroup analysis of compositions of microbiomes with covariate adjustments and repeated measures.多群组分析带有协变量调整和重复测量的微生物组组成。

Nat Methods. 2024 Jan;21(1):83-91. doi: 10.1038/s41592-023-02092-7. Epub 2023 Dec 29.

Impact of metformin and Dysosmobacter welbionis on diet-induced obesity and diabetes: from clinical observation to preclinical intervention.二甲双胍和韦荣球菌对饮食诱导肥胖和糖尿病的影响：从临床观察到临床前干预。

Diabetologia. 2024 Feb;67(2):333-345. doi: 10.1007/s00125-023-06032-0. Epub 2023 Oct 28.

Simple and flexible sign and rank-based methods for testing for differential abundance in microbiome studies.用于微生物组研究中差异丰度检测的简单灵活的基于符号和秩的方法。

PLoS One. 2023 Sep 26;18(9):e0292055. doi: 10.1371/journal.pone.0292055. eCollection 2023.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

MaAsLin 3：改进和扩展用于宏基因组关联发现的广义多变量线性模型。

MaAsLin 3: Refining and extending generalized multivariable linear models for meta-omic association discovery.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献