将 16S 基因拷贝数信息纳入其中可以提高对微生物多样性和丰度的估计。

Incorporating 16S gene copy number information improves estimates of microbial diversity and abundance.

机构信息

Institute of Ecology & Evolution, University of Oregon, Eugene, Oregon, USA.

出版信息

PLoS Comput Biol. 2012;8(10):e1002743. doi: 10.1371/journal.pcbi.1002743. Epub 2012 Oct 25.

DOI:10.1371/journal.pcbi.1002743

PMID:23133348

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3486904/

Abstract

The abundance of different SSU rRNA ("16S") gene sequences in environmental samples is widely used in studies of microbial ecology as a measure of microbial community structure and diversity. However, the genomic copy number of the 16S gene varies greatly - from one in many species to up to 15 in some bacteria and to hundreds in some microbial eukaryotes. As a result of this variation the relative abundance of 16S genes in environmental samples can be attributed both to variation in the relative abundance of different organisms, and to variation in genomic 16S copy number among those organisms. Despite this fact, many studies assume that the abundance of 16S gene sequences is a surrogate measure of the relative abundance of the organisms containing those sequences. Here we present a method that uses data on sequences and genomic copy number of 16S genes along with phylogenetic placement and ancestral state estimation to estimate organismal abundances from environmental DNA sequence data. We use theory and simulations to demonstrate that 16S genomic copy number can be accurately estimated from the short reads typically obtained from high-throughput environmental sequencing of the 16S gene, and that organismal abundances in microbial communities are more strongly correlated with estimated abundances obtained from our method than with gene abundances. We re-analyze several published empirical data sets and demonstrate that the use of gene abundance versus estimated organismal abundance can lead to different inferences about community diversity and structure and the identity of the dominant taxa in microbial communities. Our approach will allow microbial ecologists to make more accurate inferences about microbial diversity and abundance based on 16S sequence data.

摘要

环境样本中不同的小亚基核糖体 RNA（“16S”）基因序列的丰度被广泛用于微生物生态学研究，作为微生物群落结构和多样性的衡量标准。然而，16S 基因的基因组拷贝数差异很大——从许多物种中的一个到某些细菌中的 15 个，再到某些微生物真核生物中的数百个。由于这种变化，环境样本中 16S 基因的相对丰度既可以归因于不同生物体相对丰度的变化，也可以归因于这些生物体中基因组 16S 拷贝数的变化。尽管如此，许多研究仍假设 16S 基因序列的丰度是包含这些序列的生物体相对丰度的替代衡量标准。在这里，我们提出了一种方法，该方法使用 16S 基因序列和基因组拷贝数的数据以及系统发育定位和祖先状态估计，从环境 DNA 序列数据中估计生物体的丰度。我们使用理论和模拟来证明，可以从高通量环境 16S 基因测序通常获得的短读序列中准确估计 16S 基因组拷贝数，并且微生物群落中生物体的丰度与我们方法获得的估计丰度的相关性比与基因丰度的相关性更强。我们重新分析了几个已发表的经验数据集，并证明了使用基因丰度与估计的生物体丰度可以导致对群落多样性和结构以及微生物群落中主要分类群的身份的不同推断。我们的方法将使微生物生态学家能够根据 16S 序列数据更准确地推断微生物多样性和丰度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad13/3486904/f44730c715ab/pcbi.1002743.g001.jpg

相似文献

Incorporating 16S gene copy number information improves estimates of microbial diversity and abundance.

PLoS Comput Biol. 2012;8(10):e1002743. doi: 10.1371/journal.pcbi.1002743. Epub 2012 Oct 25.

The variability of the 16S rRNA gene in bacterial genomes and its consequences for bacterial community analyses.

PLoS One. 2013;8(2):e57923. doi: 10.1371/journal.pone.0057923. Epub 2013 Feb 27.

Tissue-Associated Bacterial Alterations in Rectal Carcinoma Patients Revealed by 16S rRNA Community Profiling.

Front Cell Infect Microbiol. 2016 Dec 9;6:179. doi: 10.3389/fcimb.2016.00179. eCollection 2016.

Phospholipid fatty acid (PLFA) analysis as a tool to estimate absolute abundances from compositional 16S rRNA bacterial metabarcoding data.

J Microbiol Methods. 2021 Sep;188:106271. doi: 10.1016/j.mimet.2021.106271. Epub 2021 Jun 17.

VITCOMIC2: visualization tool for the phylogenetic composition of microbial communities based on 16S rRNA gene amplicons and metagenomic shotgun sequencing.

BMC Syst Biol. 2018 Mar 19;12(Suppl 2):30. doi: 10.1186/s12918-018-0545-2.

Modified RNA-seq method for microbial community and diversity analysis using rRNA in different types of environmental samples.

PLoS One. 2017 Oct 10;12(10):e0186161. doi: 10.1371/journal.pone.0186161. eCollection 2017.

16Stimator: statistical estimation of ribosomal gene copy numbers from draft genome assemblies.

ISME J. 2016 Apr;10(4):1020-4. doi: 10.1038/ismej.2015.161. Epub 2015 Sep 11.

How, When, and Where Relic DNA Affects Microbial Diversity.

mBio. 2018 Jun 19;9(3):e00637-18. doi: 10.1128/mBio.00637-18.

Appl Environ Microbiol. 2016 Apr 18;82(9):2751-2762. doi: 10.1128/AEM.00247-16. Print 2016 May.

Correcting for 16S rRNA gene copy numbers in microbiome surveys remains an unsolved problem.

Microbiome. 2018 Feb 26;6(1):41. doi: 10.1186/s40168-018-0420-9.

引用本文的文献

SyFi: generating and using sequence fingerprints to distinguish SynCom isolates.

Microb Genom. 2025 Sep;11(9). doi: 10.1099/mgen.0.001461.

Detection of in amniotic fluids via reanalysis of prenatal copy number variation sequencing data: an exploratory study.

Front Cell Infect Microbiol. 2025 Aug 14;15:1579049. doi: 10.3389/fcimb.2025.1579049. eCollection 2025.

Cervicovaginal microbial features predict spread to the upper genital tract of infected women.

Infect Immun. 2025 Sep 9;93(9):e0005725. doi: 10.1128/iai.00057-25. Epub 2025 Aug 12.

Predicting gene distribution in ammonia-oxidizing archaea using phylogenetic signals.

ISME Commun. 2025 May 23;5(1):ycaf087. doi: 10.1093/ismeco/ycaf087. eCollection 2025 Jan.

Comparison of RNA- and DNA-based 16S amplicon sequencing to find the optimal approach for the analysis of the uterine microbiome.

Sci Rep. 2025 May 16;15(1):17037. doi: 10.1038/s41598-025-00969-5.

A curated bacterial and archaeal 16S rRNA Gene Oral Sequences dataset.

Sci Data. 2025 May 2;12(1):729. doi: 10.1038/s41597-025-05050-4.

The powerbend distribution provides a unified model for the species abundance distribution across animals, plants and microbes.

Nat Commun. 2025 Apr 29;16(1):4035. doi: 10.1038/s41467-025-59253-9.

DNA metabarcoding analysis revealed a silent prevalence of environmental pathogenic in urban area of Okinawa Island, Japan.

One Health. 2025 Mar 18;20:101016. doi: 10.1016/j.onehlt.2025.101016. eCollection 2025 Jun.

ISCAZIM: Integrated statistical correlation analysis for zero-inflated microbiome data.

Heliyon. 2024 Dec 18;11(1):e41184. doi: 10.1016/j.heliyon.2024.e41184. eCollection 2025 Jan 15.

Wise Roles and Future Visionary Endeavors of Current Emperor: Advancing Dynamic Methods for Longitudinal Microbiome Meta-Omics Data in Personalized and Precision Medicine.

Adv Sci (Weinh). 2024 Dec;11(47):e2400458. doi: 10.1002/advs.202400458. Epub 2024 Nov 13.

本文引用的文献

Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons.

Genome Res. 2011 Mar;21(3):494-504. doi: 10.1101/gr.112730.110. Epub 2011 Jan 6.

pplacer: linear time maximum-likelihood and Bayesian phylogenetic placement of sequences onto a fixed reference tree.

BMC Bioinformatics. 2010 Oct 30;11:538. doi: 10.1186/1471-2105-11-538.

Unlocking short read sequencing for metagenomics.

PLoS One. 2010 Jul 28;5(7):e11840. doi: 10.1371/journal.pone.0011840.

Picante: R tools for integrating phylogenies and ecology.

Bioinformatics. 2010 Jun 1;26(11):1463-4. doi: 10.1093/bioinformatics/btq166. Epub 2010 Apr 15.

QIIME allows analysis of high-throughput community sequencing data.

Nat Methods. 2010 May;7(5):335-6. doi: 10.1038/nmeth.f.303. Epub 2010 Apr 11.

A human gut microbial gene catalogue established by metagenomic sequencing.

Nature. 2010 Mar 4;464(7285):59-65. doi: 10.1038/nature08821.

A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea.

Nature. 2009 Dec 24;462(7276):1056-60. doi: 10.1038/nature08656.

PyNAST: a flexible tool for aligning sequences to a template alignment.

Bioinformatics. 2010 Jan 15;26(2):266-7. doi: 10.1093/bioinformatics/btp636. Epub 2009 Nov 13.

Bacterial community variation in human body habitats across space and time.

Science. 2009 Dec 18;326(5960):1694-7. doi: 10.1126/science.1177486. Epub 2009 Nov 5.

Visualization of ribosomal RNA operon copy number distribution.

BMC Microbiol. 2009 Sep 25;9:208. doi: 10.1186/1471-2180-9-208.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

将 16S 基因拷贝数信息纳入其中可以提高对微生物多样性和丰度的估计。

Incorporating 16S gene copy number information improves estimates of microbial diversity and abundance.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献