秩归一化在控制假发现率的同时，增强了微生物组差异丰度分析的 t 检验。

Rank normalization empowers a t-test for microbiome differential abundance analysis while controlling for false discoveries.

机构信息

Department of Biostatistics, University of Iowa College of Public Health, 145 N Riverside Dr, 52242, IA, USA.

Department of Biostatistics, Yale School of Public Health, 60 College St, 06510, CT, USA.

出版信息

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab059.

DOI:10.1093/bib/bbab059

PMID:33822893

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9630402/

Abstract

A major task in the analysis of microbiome data is to identify microbes associated with differing biological conditions. Before conducting analysis, raw data must first be adjusted so that counts from different samples are comparable. A typical approach is to estimate normalization factors by which all counts in a sample are multiplied or divided. However, the inherent variation associated with estimation of normalization factors are often not accounted for in subsequent analysis, leading to a loss of precision. Rank normalization is a nonparametric alternative to the estimation of normalization factors in which each count for a microbial feature is replaced by its intrasample rank. Although rank normalization has been successfully applied to microarray analysis in the past, it has yet to be explored for microbiome data, which is characterized by high frequencies of 0s, strongly correlated features and compositionality. We propose to use rank normalization as an alternative to the estimation of normalization factors and examine its performance when paired with a two-sample t-test. On a rigorous 3rd-party benchmarking simulation, it is shown to offer strong control over the false discovery rate, and at sample sizes greater than 50 per treatment group, to offer an improvement in performance over commonly used normalization factors paired with t-tests, Wilcoxon rank-sum tests and methodologies implemented by R packages. On two real datasets, it yielded valid and reproducible results that were strongly in agreement with the original findings and the existing literature, further demonstrating its robustness and future potential. Availability: The data underlying this article are available online along with R code and supplementary materials at https://github.com/matthewlouisdavisBioStat/Rank-Normalization-Empowers-a-T-Test.

摘要

微生物组数据分析的主要任务是确定与不同生物条件相关的微生物。在进行分析之前，必须首先调整原始数据，以使来自不同样本的数据具有可比性。一种典型的方法是通过估计标准化因子来实现，即对样本中的所有计数进行乘除。然而，在后续分析中通常没有考虑到与估计标准化因子相关的固有变异，从而导致精度损失。秩归一化是一种替代标准化因子估计的非参数方法，其中每个微生物特征的计数都被替换为其样本内的秩。尽管秩归一化在过去已成功应用于微阵列分析，但尚未针对微生物组数据进行探索，微生物组数据的特征是 0 出现频率高、特征相关性强且具有组成性。我们建议使用秩归一化替代标准化因子的估计，并在与双样本 t 检验结合使用时检查其性能。在严格的第三方基准模拟中，它显示出对假发现率的强有力控制，并且在每个治疗组的样本量大于 50 时，与 t 检验、Wilcoxon 秩和检验和 R 包中实现的方法结合使用的常用标准化因子相比，它在性能上有所提高。在两个真实数据集上，它产生了有效且可重复的结果，与原始发现和现有文献高度一致，进一步证明了其稳健性和未来潜力。可获取性：本文所依据的数据可在线获取，同时还有 R 代码和补充材料，网址为 https://github.com/matthewlouisdavisBioStat/Rank-Normalization-Empowers-a-T-Test。

相似文献

Rank normalization empowers a t-test for microbiome differential abundance analysis while controlling for false discoveries.秩归一化在控制假发现率的同时，增强了微生物组差异丰度分析的 t 检验。

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab059.

Large-scale benchmarking reveals false discoveries and count transformation sensitivity in 16S rRNA gene amplicon data analysis methods used in microbiome studies.大规模基准测试揭示了微生物组研究中使用的 16S rRNA 基因扩增子数据分析方法中的假发现和计数转换敏感性。

Microbiome. 2016 Nov 25;4(1):62. doi: 10.1186/s40168-016-0208-8.

Normalization and microbial differential abundance strategies depend upon data characteristics.归一化和微生物差异丰度策略取决于数据特征。

Microbiome. 2017 Mar 3;5(1):27. doi: 10.1186/s40168-017-0237-y.

An empirical Bayes approach to normalization and differential abundance testing for microbiome data.一种针对微生物组数据的标准化和差异丰度检验的经验贝叶斯方法。

BMC Bioinformatics. 2020 Jun 3;21(1):225. doi: 10.1186/s12859-020-03552-z.

A novel normalization and differential abundance test framework for microbiome data.一种用于微生物组数据的归一化和差异丰度测试的新框架。

Bioinformatics. 2020 Jul 1;36(13):3959-3965. doi: 10.1093/bioinformatics/btaa255.

Colorectal Cancer and the Human Gut Microbiome: Reproducibility with Whole-Genome Shotgun Sequencing.结直肠癌与人类肠道微生物群：全基因组鸟枪法测序的可重复性

PLoS One. 2016 May 12;11(5):e0155362. doi: 10.1371/journal.pone.0155362. eCollection 2016.

A realistic benchmark for differential abundance testing and confounder adjustment in human microbiome studies.用于人类微生物组研究中差异丰度检验和混杂因素调整的现实基准。

Genome Biol. 2024 Sep 25;25(1):247. doi: 10.1186/s13059-024-03390-9.

Systematically assessing microbiome-disease associations identifies drivers of inconsistency in metagenomic research.系统评估微生物组-疾病关联可识别宏基因组研究中不一致的驱动因素。

PLoS Biol. 2022 Mar 2;20(3):e3001556. doi: 10.1371/journal.pbio.3001556. eCollection 2022 Mar.

Transformation and differential abundance analysis of microbiome data incorporating phylogeny.整合系统发育信息的微生物组数据的转化和差异丰度分析。

Bioinformatics. 2021 Dec 11;37(24):4652-4660. doi: 10.1093/bioinformatics/btab543.

Tutorial: assessing metagenomics software with the CAMI benchmarking toolkit.教程：使用 CAMI 基准测试工具包评估宏基因组学软件。

Nat Protoc. 2021 Apr;16(4):1785-1801. doi: 10.1038/s41596-020-00480-3. Epub 2021 Mar 1.

引用本文的文献

Microbial inoculants modulate the rhizosphere microbiome, alleviate plant stress responses, and enhance maize growth at field scale.微生物接种剂可调节根际微生物群落，减轻植物应激反应，并在田间尺度上促进玉米生长。

Genome Biol. 2025 Jun 1;26(1):148. doi: 10.1186/s13059-025-03621-7.

E-value: a superior alternative to P-value and its adjustments in DNA methylation studies.E 值：DNA 甲基化研究中优于 P 值及其调整的替代指标。

Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad241.

本文引用的文献

Clinical Implications of the Associations Between Intestinal Microbiome and Colorectal Cancer Progression.肠道微生物群与结直肠癌进展之间关联的临床意义

Cancer Manag Res. 2020 Jun 9;12:4117-4128. doi: 10.2147/CMAR.S240108. eCollection 2020.

Statistical Analysis of Metagenomics Data.宏基因组学数据的统计分析

Genomics Inform. 2019 Mar;17(1):e6. doi: 10.5808/GI.2019.17.1.e6. Epub 2019 Mar 31.

Exploring the Human Microbiome: The Potential Future Role of Next-Generation Sequencing in Disease Diagnosis and Treatment.探索人类微生物组：下一代测序在疾病诊断和治疗中的潜在未来作用。

Front Immunol. 2019 Jan 7;9:2868. doi: 10.3389/fimmu.2018.02868. eCollection 2018.

Microbiota profile in new-onset pediatric Crohn's disease: data from a non-Western population.新发小儿克罗恩病的微生物群特征：来自非西方人群的数据。

Gut Pathog. 2018 Nov 29;10:49. doi: 10.1186/s13099-018-0276-3. eCollection 2018.

Microbiome 101: Studying, Analyzing, and Interpreting Gut Microbiome Data for Clinicians.微生物组 101：临床医生的肠道微生物组数据研究、分析和解读。

Clin Gastroenterol Hepatol. 2019 Jan;17(2):218-230. doi: 10.1016/j.cgh.2018.09.017. Epub 2018 Sep 18.

Hypothesis Testing and Statistical Analysis of Microbiome.微生物组的假设检验与统计分析

Genes Dis. 2017 Sep;4(3):138-148. doi: 10.1016/j.gendis.2017.06.001. Epub 2017 Jun 23.

Fecal microbiota transplantation: Review and update.粪便微生物群移植：综述与更新。

J Formos Med Assoc. 2019 Mar;118 Suppl 1:S23-S31. doi: 10.1016/j.jfma.2018.08.011. Epub 2018 Sep 1.

and colorectal cancer: A review.以及结直肠癌：一篇综述。

World J Gastrointest Oncol. 2018 Mar 15;10(3):71-81. doi: 10.4251/wjgo.v10.i3.71.

Microbiome Datasets Are Compositional: And This Is Not Optional.微生物组数据集具有构成性：这并非可有可无。

Front Microbiol. 2017 Nov 15;8:2224. doi: 10.3389/fmicb.2017.02224. eCollection 2017.

Accessible, curated metagenomic data through ExperimentHub.通过ExperimentHub获取经过整理的可访问宏基因组数据。

Nat Methods. 2017 Oct 31;14(11):1023-1024. doi: 10.1038/nmeth.4468.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验