• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

违背诺言:微生物组差异丰度方法无法控制假发现率。

A broken promise: microbiome differential abundance methods do not control the false discovery rate.

机构信息

Department of Mathematical Modelling, Statistics and Bioinformatics at Ghent University, Belgium.

Center for Statistics at Hasselt University, Belgium.

出版信息

Brief Bioinform. 2019 Jan 18;20(1):210-221. doi: 10.1093/bib/bbx104.

DOI:10.1093/bib/bbx104
PMID:28968702
Abstract

High-throughput sequencing technologies allow easy characterization of the human microbiome, but the statistical methods to analyze microbiome data are still in their infancy. Differential abundance methods aim at detecting associations between the abundances of bacterial species and subject grouping factors. The results of such methods are important to identify the microbiome as a prognostic or diagnostic biomarker or to demonstrate efficacy of prodrug or antibiotic drugs. Because of a lack of benchmarking studies in the microbiome field, no consensus exists on the performance of the statistical methods. We have compared a large number of popular methods through extensive parametric and nonparametric simulation as well as real data shuffling algorithms. The results are consistent over the different approaches and all point to an alarming excess of false discoveries. This raises great doubts about the reliability of discoveries in past studies and imperils reproducibility of microbiome experiments. To further improve method benchmarking, we introduce a new simulation tool that allows to generate correlated count data following any univariate count distribution; the correlation structure may be inferred from real data. Most simulation studies discard the correlation between species, but our results indicate that this correlation can negatively affect the performance of statistical methods.

摘要

高通量测序技术使得人类微生物组的特征变得容易,但分析微生物组数据的统计方法仍处于起步阶段。差异丰度方法旨在检测细菌物种丰度与主体分组因素之间的关联。这些方法的结果对于确定微生物组作为预后或诊断生物标志物,或证明前药或抗生素药物的疗效非常重要。由于缺乏微生物组领域的基准研究,因此在统计方法的性能方面没有共识。我们通过广泛的参数和非参数模拟以及真实数据随机化算法比较了大量流行的方法。不同方法的结果是一致的,并且都指向假发现的惊人过剩。这对过去研究中发现的可靠性提出了很大的质疑,并危及微生物组实验的可重复性。为了进一步改进方法基准测试,我们引入了一种新的模拟工具,该工具可以根据任何单变量计数分布生成相关的计数数据;相关结构可以从实际数据中推断出来。大多数模拟研究忽略了物种之间的相关性,但我们的结果表明,这种相关性可能会对统计方法的性能产生负面影响。

相似文献

1
A broken promise: microbiome differential abundance methods do not control the false discovery rate.违背诺言:微生物组差异丰度方法无法控制假发现率。
Brief Bioinform. 2019 Jan 18;20(1):210-221. doi: 10.1093/bib/bbx104.
2
Large-scale benchmarking reveals false discoveries and count transformation sensitivity in 16S rRNA gene amplicon data analysis methods used in microbiome studies.大规模基准测试揭示了微生物组研究中使用的 16S rRNA 基因扩增子数据分析方法中的假发现和计数转换敏感性。
Microbiome. 2016 Nov 25;4(1):62. doi: 10.1186/s40168-016-0208-8.
3
MIDASim: a fast and simple simulator for realistic microbiome data.MIDASim:一个快速而简单的用于真实微生物组数据模拟的工具。
Microbiome. 2024 Jul 22;12(1):135. doi: 10.1186/s40168-024-01822-z.
4
Correcting for batch effects in case-control microbiome studies.病例对照微生物组研究中批次效应的校正。
PLoS Comput Biol. 2018 Apr 23;14(4):e1006102. doi: 10.1371/journal.pcbi.1006102. eCollection 2018 Apr.
5
metamicrobiomeR: an R package for analysis of microbiome relative abundance data using zero-inflated beta GAMLSS and meta-analysis across studies using random effects models.metamicrobiomeR:一个用于分析微生物组相对丰度数据的 R 包,使用零膨胀 beta GAMLSS 和随机效应模型进行跨研究的荟萃分析。
BMC Bioinformatics. 2019 Apr 16;20(1):188. doi: 10.1186/s12859-019-2744-2.
6
A comparison of sequencing platforms and bioinformatics pipelines for compositional analysis of the gut microbiome.用于肠道微生物组组成分析的测序平台和生物信息学管道的比较。
BMC Microbiol. 2017 Sep 13;17(1):194. doi: 10.1186/s12866-017-1101-8.
7
A comprehensive evaluation of microbial differential abundance analysis methods: current status and potential solutions.微生物差异丰度分析方法的综合评估:现状与潜在解决方案。
Microbiome. 2022 Aug 19;10(1):130. doi: 10.1186/s40168-022-01320-0.
8
A realistic benchmark for differential abundance testing and confounder adjustment in human microbiome studies.用于人类微生物组研究中差异丰度检验和混杂因素调整的现实基准。
Genome Biol. 2024 Sep 25;25(1):247. doi: 10.1186/s13059-024-03390-9.
9
fastCCLasso: a fast and efficient algorithm for estimating correlation matrix from compositional data.fastCCLasso:一种从成分数据中估计相关矩阵的快速高效算法。
Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae314.
10
Correlation and association analyses in microbiome study integrating multiomics in health and disease.在健康和疾病的多组学整合微生物组研究中进行相关性和关联性分析。
Prog Mol Biol Transl Sci. 2020;171:309-491. doi: 10.1016/bs.pmbts.2020.04.003. Epub 2020 May 23.

引用本文的文献

1
Assessment of absolute abundance in mother-infant gut microbiome using marine-sourced bacterial DNA spike-in and comparison with conventional quantification methods.使用海洋来源的细菌DNA内参评估母婴肠道微生物群的绝对丰度,并与传统定量方法进行比较。
Microbiome Res Rep. 2025 Jun 9;4(2):23. doi: 10.20517/mrr.2024.94. eCollection 2025.
2
Explicit Scale Simulation for analysis of RNA-sequencing count data with ALDEx2.使用ALDEx2对RNA测序计数数据进行分析的显式尺度模拟。
NAR Genom Bioinform. 2025 Aug 19;7(3):lqaf108. doi: 10.1093/nargab/lqaf108. eCollection 2025 Sep.
3
A workflow for statistical analysis and visualization of microbiome omics data using the R microeco package.
一种使用R语言的microeco软件包对微生物组组学数据进行统计分析和可视化的工作流程。
Nat Protoc. 2025 Aug 6. doi: 10.1038/s41596-025-01239-4.
4
A systematic benchmark of integrative strategies for microbiome-metabolome data.微生物组-代谢组数据整合策略的系统基准测试
Commun Biol. 2025 Jul 25;8(1):1100. doi: 10.1038/s42003-025-08515-9.
5
metaGEENOME: an integrated framework for differential abundance analysis of microbiome data in cross-sectional and longitudinal studies.metaGEENOME:一个用于横断面和纵向研究中微生物组数据差异丰度分析的综合框架。
BMC Bioinformatics. 2025 Jul 21;26(1):189. doi: 10.1186/s12859-025-06217-x.
6
Composition-on-composition regression analysis for multi-omics integration of metagenomic data.宏基因组数据多组学整合的组合对组合回归分析
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf387.
7
Incorporating scale uncertainty in microbiome and gene expression analysis as an extension of normalization.将尺度不确定性纳入微生物组和基因表达分析,作为归一化的扩展。
Genome Biol. 2025 May 22;26(1):139. doi: 10.1186/s13059-025-03609-3.
8
A semiparametric quantile regression rank score test for zero-inflated data.用于零膨胀数据的半参数分位数回归秩得分检验。
Biometrics. 2025 Apr 2;81(2). doi: 10.1093/biomtc/ujaf050.
9
Postlarval Shrimp-Associated Microbiota and Underlying Ecological Processes over AHPND Progression.幼虾后阶段与虾类急性肝胰腺坏死病(AHPND)病程相关的微生物群及其潜在生态过程
Microorganisms. 2025 Mar 24;13(4):720. doi: 10.3390/microorganisms13040720.
10
Identity and timing of protist inoculation affect plant performance largely irrespective of changes in the rhizosphere microbial community.原生生物接种的身份和时间在很大程度上影响植物的生长,而与根际微生物群落的变化无关。
Appl Environ Microbiol. 2025 Apr 23;91(4):e0024025. doi: 10.1128/aem.00240-25. Epub 2025 Mar 31.