Suppr超能文献

一种用于识别响应变量与细菌群落组成之间关联的贝叶斯方法。

A Bayesian method for identifying associations between response variables and bacterial community composition.

机构信息

Bureau of Food Surveillance and Science Integration, Food Directorate, Health Products and Food Branch, Health Canada, Ottawa, Canada.

Bureau of Nutritional Sciences, Food Directorate, Health Products and Food Branch, Health Canada, Ottawa, Canada.

出版信息

PLoS Comput Biol. 2022 Jul 6;18(7):e1010108. doi: 10.1371/journal.pcbi.1010108. eCollection 2022 Jul.

Abstract

Determining associations between intestinal bacteria and continuously measured physiological outcomes is important for understanding the bacteria-host relationship but is not straightforward since abundance data (compositional data) are not normally distributed. To address this issue, we developed a fully Bayesian linear regression model (BRACoD; Bayesian Regression Analysis of Compositional Data) with physiological measurements (continuous data) as a function of a matrix of relative bacterial abundances. Bacteria can be classified as operational taxonomic units or by taxonomy (genus, family, etc.). Bacteria associated with the physiological measurement were identified using a Bayesian variable selection method: Stochastic Search Variable Selection. The output is a list of inclusion probabilities ([Formula: see text]) and coefficients that indicate the strength of the association ([Formula: see text]) for each bacterial taxa. Tests with simulated communities showed that adopting a cut point value of [Formula: see text] ≥ 0.3 for identifying included bacteria optimized the true positive rate (TPR) while maintaining a false positive rate (FPR) of ≤ 5%. At this point, the chances of identifying non-contributing bacteria were low and all well-established contributors were included. Comparison with other methods showed that BRACoD (at [Formula: see text] ≥ 0.3) had higher precision and a higher TPR than a commonly used center log transformed LASSO procedure (clr-LASSO) as well as higher TPR than an off-the-shelf Spike and Slab method after center log transformation (clr-SS). BRACoD was also less likely to include non-contributing bacteria that merely correlate with contributing bacteria. Analysis of a rat microbiome experiment identified 47 operational taxonomic units that contributed to fecal butyrate levels. Of these, 31 were positively and 16 negatively associated with butyrate. Consistent with their known role in butyrate metabolism, most of these fell within the Lachnospiraceae and Ruminococcaceae. We conclude that BRACoD provides a more precise and accurate method for determining bacteria associated with a continuous physiological outcome compared to clr-LASSO. It is more sensitive than a generalized clr-SS algorithm, although it has a higher FPR. Its ability to distinguish genuine contributors from correlated bacteria makes it better suited to discriminating bacteria that directly contribute to an outcome. The algorithm corrects for the distortions arising from compositional data making it appropriate for analysis of microbiome data.

摘要

确定肠道细菌与连续测量的生理结果之间的关联对于了解细菌与宿主的关系非常重要,但由于丰度数据(组成数据)不是正态分布的,因此并不简单。为了解决这个问题,我们开发了一种完全贝叶斯线性回归模型(BRACoD;贝叶斯组成数据分析的回归分析),将生理测量(连续数据)作为相对细菌丰度矩阵的函数。细菌可以按操作分类单位或分类(属、科等)进行分类。使用贝叶斯变量选择方法(Stochastic Search Variable Selection)识别与生理测量相关的细菌。输出是一个包含概率的列表([Formula: see text])和指示细菌分类群关联强度的系数([Formula: see text])。用模拟群落进行的测试表明,采用[Formula: see text]≥0.3 的截点值来识别包含的细菌可以优化真阳性率(TPR),同时保持假阳性率(FPR)≤5%。在这一点上,识别非贡献细菌的可能性很低,并且所有既定的贡献者都包括在内。与其他方法的比较表明,BRACoD(在[Formula: see text]≥0.3 时)的精度高于常用的中心对数变换 LASSO 过程(clr-LASSO),并且在中心对数变换后的现成 Spike and Slab 方法(clr-SS)的 TPR 也更高。BRACoD 也不太可能包含仅与贡献细菌相关的非贡献细菌。对大鼠微生物组实验的分析确定了 47 个操作分类单位,这些分类单位与粪便丁酸水平有关。其中,31 个与丁酸呈正相关,16 个与丁酸呈负相关。与它们在丁酸代谢中的已知作用一致,其中大多数属于lachnospiraceae 和 ruminococcaceae。我们得出的结论是,与 clr-LASSO 相比,BRACoD 为确定与连续生理结果相关的细菌提供了一种更精确和准确的方法。它比广义的 clr-SS 算法更敏感,尽管它的 FPR 更高。它能够区分真正的贡献者和相关的细菌,使其更适合区分直接导致结果的细菌。该算法纠正了组成数据引起的扭曲,使其适合分析微生物组数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32dc/9307184/6406c9dcea83/pcbi.1010108.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验