• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

收缩法可改善不同标准化方法下微生物关联的估计。

Shrinkage improves estimation of microbial associations under different normalization methods.

作者信息

Badri Michelle, Kurtz Zachary D, Bonneau Richard, Müller Christian L

机构信息

Department of Biology, New York University, New York, NY 10012, USA.

Lodo Therapeutics, New York, NY 10016, USA.

出版信息

NAR Genom Bioinform. 2020 Dec 17;2(4):lqaa100. doi: 10.1093/nargab/lqaa100. eCollection 2020 Dec.

DOI:10.1093/nargab/lqaa100
PMID:33575644
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7745771/
Abstract

Estimation of statistical associations in microbial genomic survey count data is fundamental to microbiome research. Experimental limitations, including count compositionality, low sample sizes and technical variability, obstruct standard application of association measures and require data normalization prior to statistical estimation. Here, we investigate the interplay between data normalization, microbial association estimation and available sample size by leveraging the large-scale American Gut Project (AGP) survey data. We analyze the statistical properties of two prominent linear association estimators, correlation and proportionality, under different sample scenarios and data normalization schemes, including RNA-seq analysis workflows and log-ratio transformations. We show that shrinkage estimation, a standard statistical regularization technique, can universally improve the quality of taxon-taxon association estimates for microbiome data. We find that large-scale association patterns in the AGP data can be grouped into five normalization-dependent classes. Using microbial association network construction and clustering as downstream data analysis examples, we show that variance-stabilizing and log-ratio approaches enable the most taxonomically and structurally coherent estimates. Taken together, the findings from our reproducible analysis workflow have important implications for microbiome studies in multiple stages of analysis, particularly when only small sample sizes are available.

摘要

在微生物基因组调查计数数据中估计统计关联是微生物组研究的基础。实验限制,包括计数的组成性、小样本量和技术变异性,阻碍了关联度量的标准应用,并且在进行统计估计之前需要对数据进行归一化处理。在此,我们通过利用大规模的美国肠道项目(AGP)调查数据,研究数据归一化、微生物关联估计和可用样本量之间的相互作用。我们分析了两种著名的线性关联估计器(相关性和比例性)在不同样本场景和数据归一化方案下的统计特性,包括RNA测序分析工作流程和对数比变换。我们表明,收缩估计作为一种标准的统计正则化技术,可以普遍提高微生物组数据中分类单元-分类单元关联估计的质量。我们发现,AGP数据中的大规模关联模式可以分为五个依赖于归一化的类别。以微生物关联网络构建和聚类作为下游数据分析示例,我们表明方差稳定和对数比方法能够实现最具分类学和结构一致性的估计。总之,我们可重复分析工作流程的研究结果对微生物组研究多个分析阶段具有重要意义,特别是在只有小样本量可用的情况下。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/81ef015d7a6c/lqaa100fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/4098604d6986/lqaa100fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/ca7a39765f67/lqaa100fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/59eda8bde731/lqaa100fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/361fc3d2f18c/lqaa100fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/4919e9f2bbfd/lqaa100fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/76624c1988ca/lqaa100fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/fe0369e2e0da/lqaa100fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/81ef015d7a6c/lqaa100fig8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/4098604d6986/lqaa100fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/ca7a39765f67/lqaa100fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/59eda8bde731/lqaa100fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/361fc3d2f18c/lqaa100fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/4919e9f2bbfd/lqaa100fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/76624c1988ca/lqaa100fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/fe0369e2e0da/lqaa100fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5917/7745771/81ef015d7a6c/lqaa100fig8.jpg

相似文献

1
Shrinkage improves estimation of microbial associations under different normalization methods.收缩法可改善不同标准化方法下微生物关联的估计。
NAR Genom Bioinform. 2020 Dec 17;2(4):lqaa100. doi: 10.1093/nargab/lqaa100. eCollection 2020 Dec.
2
Microbial Networks in SPRING - Semi-parametric Rank-Based Correlation and Partial Correlation Estimation for Quantitative Microbiome Data.SPRING中的微生物网络——用于定量微生物组数据的基于半参数秩的相关性和偏相关性估计
Front Genet. 2019 Jun 6;10:516. doi: 10.3389/fgene.2019.00516. eCollection 2019.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
NetCoMi: network construction and comparison for microbiome data in R.NetCoMi:用于微生物组数据的网络构建和比较的 R 包。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa290.
5
Rank normalization empowers a t-test for microbiome differential abundance analysis while controlling for false discoveries.秩归一化在控制假发现率的同时,增强了微生物组差异丰度分析的 t 检验。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab059.
6
Joint Adaptive Mean-Variance Regularization and Variance Stabilization of High Dimensional Data.高维数据的联合自适应均值 - 方差正则化与方差稳定化
Comput Stat Data Anal. 2012 Jul 1;56(7):2317-2333. doi: 10.1016/j.csda.2012.01.012.
7
Benchmarking microbiome transformations favors experimental quantitative approaches to address compositionality and sampling depth biases.基准测试微生物组转化有利于采用实验定量方法来解决组成和采样深度偏差问题。
Nat Commun. 2021 Jun 11;12(1):3562. doi: 10.1038/s41467-021-23821-6.
8
Normalization and microbial differential abundance strategies depend upon data characteristics.归一化和微生物差异丰度策略取决于数据特征。
Microbiome. 2017 Mar 3;5(1):27. doi: 10.1186/s40168-017-0237-y.
9
Regularized Variance Estimation and Variance Stabilization of High Dimensional Data.高维数据的正则化方差估计与方差稳定化
Proc Am Stat Assoc. 2010 Jul-Aug;2010:5295-5309.
10
An empirical Bayes approach to normalization and differential abundance testing for microbiome data.一种针对微生物组数据的标准化和差异丰度检验的经验贝叶斯方法。
BMC Bioinformatics. 2020 Jun 3;21(1):225. doi: 10.1186/s12859-020-03552-z.

引用本文的文献

1
A GLM-based zero-inflated generalized Poisson factor model for analyzing microbiome data.一种基于广义线性模型的零膨胀广义泊松因子模型,用于分析微生物组数据。
Front Microbiol. 2024 May 30;15:1394204. doi: 10.3389/fmicb.2024.1394204. eCollection 2024.
2
Microbiome-based classification models for fresh produce safety and quality evaluation.基于微生物组的分类模型在新鲜农产品安全和质量评价中的应用。
Microbiol Spectr. 2024 Apr 2;12(4):e0344823. doi: 10.1128/spectrum.03448-23. Epub 2024 Mar 6.
3
Over-optimism in unsupervised microbiome analysis: Insights from network learning and clustering.

本文引用的文献

1
Sparse semiparametric canonical correlation analysis for data of mixed types.混合类型数据的稀疏半参数典型相关分析
Biometrika. 2020 Sep;107(3):609-625. doi: 10.1093/biomet/asaa007. Epub 2020 Apr 15.
2
MODELING MICROBIAL ABUNDANCES AND DYSBIOSIS WITH BETA-BINOMIAL REGRESSION.使用贝塔二项式回归对微生物丰度和生态失调进行建模。
Ann Appl Stat. 2020 Mar;14(1):94-115. doi: 10.1214/19-aoas1283. Epub 2020 Apr 16.
3
Estimating diversity in networked ecological communities.估计网络生态群落中的多样性。
无监督微生物组分析中的过度乐观:来自网络学习和聚类的见解。
PLoS Comput Biol. 2023 Jan 6;19(1):e1010820. doi: 10.1371/journal.pcbi.1010820. eCollection 2023 Jan.
4
Poisson hurdle model-based method for clustering microbiome features.基于泊松 hurdle 模型的微生物组特征聚类方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac782.
5
How much does the typical ecological meta-analysis overestimate the true mean effect size?典型的生态元分析会高估真实平均效应量多少?
Ecol Evol. 2022 Nov 15;12(11):e9521. doi: 10.1002/ece3.9521. eCollection 2022 Nov.
6
Is There a Universal Endurance Microbiota?是否存在普遍的耐力微生物群?
Microorganisms. 2022 Nov 9;10(11):2213. doi: 10.3390/microorganisms10112213.
7
Metagenomic study of the gut microbiota associated with cow milk consumption in Chinese peri-/postmenopausal women.中国围绝经期/绝经后女性饮用牛奶相关肠道微生物群的宏基因组学研究。
Front Microbiol. 2022 Aug 16;13:957885. doi: 10.3389/fmicb.2022.957885. eCollection 2022.
8
Negative binomial factor regression with application to microbiome data analysis.负二项因子回归及其在微生物组数据分析中的应用。
Stat Med. 2022 Jul 10;41(15):2786-2803. doi: 10.1002/sim.9384. Epub 2022 Apr 24.
9
Adaptive and powerful microbiome multivariate association analysis via feature selection.通过特征选择实现适应性强且强大的微生物组多变量关联分析。
NAR Genom Bioinform. 2022 Jan 14;4(1):lqab120. doi: 10.1093/nargab/lqab120. eCollection 2022 Mar.
10
Bacterial low-abundant taxa are key determinants of a healthy airway metagenome in the early years of human life.细菌低丰度分类群是人类生命早期健康气道宏基因组的关键决定因素。
Comput Struct Biotechnol J. 2021 Dec 15;20:175-186. doi: 10.1016/j.csbj.2021.12.008. eCollection 2022.
Biostatistics. 2022 Jan 13;23(1):207-222. doi: 10.1093/biostatistics/kxaa015.
4
Treatment-Specific Composition of the Gut Microbiota Is Associated With Disease Remission in a Pediatric Crohn's Disease Cohort.治疗特异性肠道微生物群组成与小儿克罗恩病队列的疾病缓解相关。
Inflamm Bowel Dis. 2019 Nov 14;25(12):1927-1938. doi: 10.1093/ibd/izz130.
5
Microbial Networks in SPRING - Semi-parametric Rank-Based Correlation and Partial Correlation Estimation for Quantitative Microbiome Data.SPRING中的微生物网络——用于定量微生物组数据的基于半参数秩的相关性和偏相关性估计
Front Genet. 2019 Jun 6;10:516. doi: 10.3389/fgene.2019.00516. eCollection 2019.
6
Analysis and correction of compositional bias in sparse sequencing count data.稀疏测序计数数据中组成偏差的分析与校正。
BMC Genomics. 2018 Nov 6;19(1):799. doi: 10.1186/s12864-018-5160-5.
7
American Gut: an Open Platform for Citizen Science Microbiome Research.美国肠道计划:一个用于公民科学微生物组研究的开放平台。
mSystems. 2018 May 15;3(3). doi: 10.1128/mSystems.00031-18. eCollection 2018 May-Jun.
8
Meta-analysis of gut microbiome studies identifies disease-specific and shared responses.基于宏基因组关联研究的肠道微生物组分析鉴定出疾病特异性和共享反应。
Nat Commun. 2017 Dec 5;8(1):1784. doi: 10.1038/s41467-017-01973-8.
9
Microbiome Datasets Are Compositional: And This Is Not Optional.微生物组数据集具有构成性:这并非可有可无。
Front Microbiol. 2017 Nov 15;8:2224. doi: 10.3389/fmicb.2017.02224. eCollection 2017.
10
propr: An R-package for Identifying Proportionally Abundant Features Using Compositional Data Analysis.propr:一个使用成分数据分析识别比例丰富特征的 R 包。
Sci Rep. 2017 Nov 24;7(1):16252. doi: 10.1038/s41598-017-16520-0.