Suppr超能文献

MaAsLin 3:改进和扩展用于宏基因组关联发现的广义多变量线性模型。

MaAsLin 3: Refining and extending generalized multivariable linear models for meta-omic association discovery.

作者信息

Nickols William A, Kuntz Thomas, Shen Jiaxian, Maharjan Sagun, Mallick Himel, Franzosa Eric A, Thompson Kelsey N, Nearing Jacob T, Huttenhower Curtis

机构信息

Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.

Harvard Chan Microbiome in Public Health Center, Harvard T. H. Chan School of Public Health, Boston, MA, USA.

出版信息

bioRxiv. 2024 Dec 14:2024.12.13.628459. doi: 10.1101/2024.12.13.628459.

Abstract

A key question in microbial community analysis is determining which microbial features are associated with community properties such as environmental or health phenotypes. This statistical task is impeded by characteristics of typical microbial community profiling technologies, including sparsity (which can be either technical or biological) and the compositionality imposed by most nucleotide sequencing approaches. Many models have been proposed that focus on how the relative abundance of a feature (e.g. taxon or pathway) relates to one or more covariates. Few of these, however, simultaneously control false discovery rates, achieve reasonable power, incorporate complex modeling terms such as random effects, and also permit assessment of prevalence (presence/absence) associations and absolute abundance associations (when appropriate measurements are available, e.g. qPCR or spike-ins). Here, we introduce MaAsLin 3 (Microbiome Multivariable Associations with Linear Models), a modeling framework that simultaneously identifies both abundance and prevalence relationships in microbiome studies with modern, potentially complex designs. MaAsLin 3 also newly accounts for compositionality with experimental (spike-ins and total microbial load estimation) or computational techniques, and it expands the space of biological hypotheses that can be tested with inference for new covariate types. On a variety of synthetic and real datasets, MaAsLin 3 outperformed current state-of-the-art differential abundance methods in testing and inferring associations from compositional data. When applied to the Inflammatory Bowel Disease Multi-omics Database, MaAsLin 3 corroborated many previously reported microbial associations with the inflammatory bowel diseases, but notably 77% of associations were with feature prevalence rather than abundance. In summary, MaAsLin 3 enables researchers to identify microbiome associations with higher accuracy and more specific association types, especially in complex datasets with multiple covariates and repeated measures.

摘要

微生物群落分析中的一个关键问题是确定哪些微生物特征与群落特性相关,如环境或健康表型。典型的微生物群落分析技术的特点阻碍了这项统计任务,这些特点包括稀疏性(可能是技术上的或生物学上的)以及大多数核苷酸测序方法所带来的组成性。已经提出了许多模型,这些模型关注一个特征(如分类群或通路)的相对丰度如何与一个或多个协变量相关。然而,其中很少有模型能同时控制错误发现率、获得合理的功效、纳入诸如随机效应等复杂的建模项,并且还能允许评估患病率(存在/不存在)关联和绝对丰度关联(当有适当的测量方法时,例如定量聚合酶链反应或内参)。在这里,我们介绍MaAsLin 3(微生物组多变量线性模型关联),这是一个建模框架,它能在具有现代的、可能复杂设计的微生物组研究中同时识别丰度和患病率关系。MaAsLin 3还通过实验(内参和总微生物负荷估计)或计算技术新纳入了组成性,并且它扩展了可以通过对新协变量类型进行推断来检验的生物学假设空间。在各种合成数据集和真实数据集上,MaAsLin 3在从组成数据中测试和推断关联方面优于当前最先进的差异丰度方法。当应用于炎症性肠病多组学数据库时,MaAsLin 3证实了许多先前报道的与炎症性肠病相关的微生物关联,但值得注意的是,77%的关联是与特征患病率而非丰度相关。总之,MaAsLin 3使研究人员能够以更高的准确性和更具体的关联类型识别微生物组关联,特别是在具有多个协变量和重复测量的复杂数据集中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9d5/11661281/d28e8a28e07d/nihpp-2024.12.13.628459v1-f0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验