微生物组的成分数据分析：基础、工具与挑战

Compositional data analysis of the microbiome: fundamentals, tools, and challenges.

作者信息

Tsilimigras Matthew C B, Fodor Anthony A

机构信息

Department of Bioinformatics and Genomics, UNC Charlotte, Bioinformatics Building, The University of North Carolina, Charlotte 9201, University City Blvd, Charlotte.

出版信息

Ann Epidemiol. 2016 May;26(5):330-5. doi: 10.1016/j.annepidem.2016.03.002. Epub 2016 Mar 31.

DOI:10.1016/j.annepidem.2016.03.002

PMID:27255738

Abstract

PURPOSE

Human microbiome studies are within the realm of compositional data with the absolute abundances of microbes not recoverable from sequence data alone. In compositional data analysis, each sample consists of proportions of various organisms with a sum constrained to a constant. This simple feature can lead traditional statistical treatments when naively applied to produce errant results and spurious correlations.

METHODS

We review the origins of compositionality in microbiome data, the theory and usage of compositional data analysis in this setting and some recent attempts at solutions to these problems.

RESULTS

Microbiome sequence data sets are typically high dimensional, with the number of taxa much greater than the number of samples, and sparse as most taxa are only observed in a small number of samples. These features of microbiome sequence data interact with compositionality to produce additional challenges in analysis.

CONCLUSIONS

Despite sophisticated approaches to statistical transformation, the analysis of compositional data may remain a partially intractable problem, limiting inference. We suggest that current research needs include better generation of simulated data and further study of how the severity of compositional effects changes when sampling microbial communities of widely differing diversity.

摘要

目的

人类微生物组研究属于成分数据范畴，仅从序列数据无法获取微生物的绝对丰度。在成分数据分析中，每个样本由各种生物体的比例组成，其总和被限制为一个常数。当简单地应用传统统计方法时，这一简单特征可能会导致错误的结果和虚假的相关性。

方法

我们回顾了微生物组数据中成分性的起源、在这种情况下成分数据分析的理论和应用，以及最近一些针对这些问题的解决方案尝试。

结果

微生物组序列数据集通常是高维的，分类单元的数量远大于样本数量，并且很稀疏，因为大多数分类单元仅在少数样本中被观察到。微生物组序列数据的这些特征与成分性相互作用，在分析中产生了额外的挑战。

结论

尽管有复杂的统计转换方法，但成分数据的分析可能仍然是一个部分难以解决的问题，限制了推断。我们建议当前的研究需求包括更好地生成模拟数据，以及进一步研究在对多样性差异很大的微生物群落进行采样时，成分效应的严重程度如何变化。

相似文献

Compositional data analysis of the microbiome: fundamentals, tools, and challenges.

Ann Epidemiol. 2016 May;26(5):330-5. doi: 10.1016/j.annepidem.2016.03.002. Epub 2016 Mar 31.

CCLasso: correlation inference for compositional data through Lasso.

Bioinformatics. 2015 Oct 1;31(19):3172-80. doi: 10.1093/bioinformatics/btv349. Epub 2015 Jun 4.

It's all relative: analyzing microbiome data as compositions.

Ann Epidemiol. 2016 May;26(5):322-9. doi: 10.1016/j.annepidem.2016.03.003. Epub 2016 Apr 2.

Analysis and correction of compositional bias in sparse sequencing count data.

BMC Genomics. 2018 Nov 6;19(1):799. doi: 10.1186/s12864-018-5160-5.

The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies.

BMC Microbiol. 2015 Mar 21;15:66. doi: 10.1186/s12866-015-0351-6.

Experimental metagenomics and ribosomal profiling of the human skin microbiome.

Exp Dermatol. 2017 Mar;26(3):211-219. doi: 10.1111/exd.13210. Epub 2017 Jan 20.

MixMC: A Multivariate Statistical Framework to Gain Insight into Microbial Communities.

PLoS One. 2016 Aug 11;11(8):e0160169. doi: 10.1371/journal.pone.0160169. eCollection 2016.

Sparse and compositionally robust inference of microbial ecological networks.

PLoS Comput Biol. 2015 May 7;11(5):e1004226. doi: 10.1371/journal.pcbi.1004226. eCollection 2015 May.

Large-scale benchmarking reveals false discoveries and count transformation sensitivity in 16S rRNA gene amplicon data analysis methods used in microbiome studies.

Microbiome. 2016 Nov 25;4(1):62. doi: 10.1186/s40168-016-0208-8.

Inference of Environmental Factor-Microbe and Microbe-Microbe Associations from Metagenomic Data Using a Hierarchical Bayesian Statistical Model.

Cell Syst. 2017 Jan 25;4(1):129-137.e5. doi: 10.1016/j.cels.2016.12.012.

引用本文的文献

SpeSpeNet: an interactive and user-friendly tool to create and explore microbial correlation networks.

ISME Commun. 2025 Feb 24;5(1):ycaf036. doi: 10.1093/ismeco/ycaf036. eCollection 2025 Jan.

Group-wise normalization in differential abundance analysis of microbiome samples.

BMC Bioinformatics. 2025 Jul 29;26(1):196. doi: 10.1186/s12859-025-06235-9.

Twenty-Four-Hour Compositional Data Analysis in Healthcare: Clinical Potential and Future Directions.

Int J Environ Res Public Health. 2025 Jun 25;22(7):1002. doi: 10.3390/ijerph22071002.

Absolute abundance unveils Basidiobolus as a cross-domain bridge indirectly bolstering gut microbiome homeostasis.

ISME J. 2025 Jan 2;19(1). doi: 10.1093/ismejo/wraf150.

The Impact of the Skin Microbiome and Oxidative Stress on the Initiation and Development of Cutaneous Chronic Wounds.

Antioxidants (Basel). 2025 Jun 4;14(6):682. doi: 10.3390/antiox14060682.

Comparative Community Ecology Reveals Conserved Ectoparasite Microbiomes Amidst Variable Host and Environment Microbiomes.

Ecol Evol. 2025 Apr 2;15(4):e71120. doi: 10.1002/ece3.71120. eCollection 2025 Apr.

Human reference microbiome profiles of different body habitats in healthy individuals.

Front Cell Infect Microbiol. 2025 Feb 11;15:1478136. doi: 10.3389/fcimb.2025.1478136. eCollection 2025.

Group-wise normalization in differential abundance analysis of microbiome samples.

ArXiv. 2024 Nov 23:arXiv:2411.15400v1.

The quest for environmental analytical microbiology: absolute quantitative microbiome using cellular internal standards.

Microbiome. 2025 Jan 27;13(1):26. doi: 10.1186/s40168-024-02009-2.

Siderophore synthetase-receptor gene coevolution reveals habitat- and pathogen-specific bacterial iron interaction networks.

Sci Adv. 2025 Jan 17;11(3):eadq5038. doi: 10.1126/sciadv.adq5038. Epub 2025 Jan 15.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

微生物组的成分数据分析：基础、工具与挑战

Compositional data analysis of the microbiome: fundamentals, tools, and challenges.

作者信息

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献