分析细菌16S rRNA基因测序数据时整合来自多个高变区的数据

Incorporation of Data From Multiple Hypervariable Regions when Analyzing Bacterial 16S rRNA Gene Sequencing Data.

作者信息

Jones Carli B, White James R, Ernst Sarah E, Sfanos Karen S, Peiffer Lauren B

机构信息

Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, MD, United States.

Resphera Biosciences, Baltimore, MD, United States.

出版信息

Front Genet. 2022 Mar 31;13:799615. doi: 10.3389/fgene.2022.799615. eCollection 2022.

DOI:10.3389/fgene.2022.799615

PMID:35432480

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9009396/

Abstract

Short read 16 S rRNA amplicon sequencing is a common technique used in microbiome research. However, inaccuracies in estimated bacterial community composition can occur due to amplification bias of the targeted hypervariable region. A potential solution is to sequence and assess multiple hypervariable regions in tandem, yet there is currently no consensus as to the appropriate method for analyzing this data. Additionally, there are many sequence analysis resources for data produced from the Illumina platform, but fewer open-source options available for data from the Ion Torrent platform. Herein, we present an analysis pipeline using open-source analysis platforms that integrates data from multiple hypervariable regions and is compatible with data produced from the Ion Torrent platform. We used the ThermoFisher Ion 16 S Metagenomics Kit and a mock community of twenty bacterial strains to assess taxonomic classification of six amplicons from separate hypervariable regions (V2, V3, V4, V6-7, V8, V9) using our analysis pipeline. We report that different amplicons have different specificities for taxonomic classification, which also has implications for global level analyses such as alpha and beta diversity. Finally, we utilize a generalized linear modeling approach to statistically integrate the results from multiple hypervariable regions and apply this methodology to data from a representative clinical cohort. We conclude that examining sequencing results across multiple hypervariable regions provides more taxonomic information than sequencing across a single region. The data across multiple hypervariable regions can be combined using generalized linear models to enhance the statistical evaluation of overall differences in community structure and relatedness among sample groups.

摘要

短读长16S rRNA扩增子测序是微生物组研究中常用的技术。然而，由于目标高变区的扩增偏差，可能会出现估计细菌群落组成的不准确情况。一种潜在的解决方案是串联测序和评估多个高变区，但目前对于分析此数据的合适方法尚无共识。此外，有许多针对Illumina平台产生的数据的序列分析资源，但针对Ion Torrent平台数据的开源选项较少。在此，我们展示了一种使用开源分析平台的分析流程，该流程整合了来自多个高变区的数据，并且与Ion Torrent平台产生的数据兼容。我们使用赛默飞世尔Ion 16S宏基因组学试剂盒和一个包含20种细菌菌株的模拟群落，通过我们的分析流程评估来自不同高变区（V2、V3、V4、V6 - 7、V8、V9）的六个扩增子的分类学分类。我们报告不同的扩增子在分类学分类上具有不同的特异性，这对诸如α和β多样性等全局水平分析也有影响。最后，我们利用广义线性建模方法对来自多个高变区的结果进行统计整合，并将此方法应用于来自一个代表性临床队列的数据。我们得出结论，与对单个区域进行测序相比，对多个高变区的测序结果进行检查可提供更多的分类学信息。来自多个高变区的数据可以使用广义线性模型进行合并，以增强对群落结构总体差异和样本组间相关性的统计评估。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6d3c/9009396/f4f25762c6a4/fgene-13-799615-g001.jpg

相似文献

Incorporation of Data From Multiple Hypervariable Regions when Analyzing Bacterial 16S rRNA Gene Sequencing Data.

Front Genet. 2022 Mar 31;13:799615. doi: 10.3389/fgene.2022.799615. eCollection 2022.

Development of an Analysis Pipeline Characterizing Multiple Hypervariable Regions of 16S rRNA Using Mock Samples.

PLoS One. 2016 Feb 1;11(2):e0148047. doi: 10.1371/journal.pone.0148047. eCollection 2016.

Multi-amplicon microbiome data analysis pipelines for mixed orientation sequences using QIIME2: Assessing reference database, variable region and pre-processing bias in classification of mock bacterial community samples.

PLoS One. 2023 Jan 13;18(1):e0280293. doi: 10.1371/journal.pone.0280293. eCollection 2023.

Determining the most accurate 16S rRNA hypervariable region for taxonomic identification from respiratory samples.

Sci Rep. 2023 Mar 9;13(1):3974. doi: 10.1038/s41598-023-30764-z.

Analysis of the effect of smoking on the buccal microbiome using next-generation sequencing technology.

J Med Microbiol. 2019 Aug;68(8):1148-1158. doi: 10.1099/jmm.0.001003. Epub 2019 Jun 14.

Comparison of different hypervariable regions of 16S rRNA for taxonomic profiling of vaginal microbiota using next-generation sequencing.

Arch Microbiol. 2021 Apr;203(3):1159-1166. doi: 10.1007/s00203-020-02114-4. Epub 2020 Nov 22.

Primer, Pipelines, Parameters: Issues in 16S rRNA Gene Sequencing.

mSphere. 2021 Feb 24;6(1):e01202-20. doi: 10.1128/mSphere.01202-20.

rpoB, a promising marker for analyzing the diversity of bacterial communities by amplicon sequencing.

BMC Microbiol. 2019 Jul 29;19(1):171. doi: 10.1186/s12866-019-1546-z.

NG-Tax, a highly accurate and validated pipeline for analysis of 16S rRNA amplicons from complex biomes.

F1000Res. 2016 Jul 22;5:1791. doi: 10.12688/f1000research.9227.2. eCollection 2016.

Design and Evaluation of Illumina MiSeq-Compatible, 18S rRNA Gene-Specific Primers for Improved Characterization of Mixed Phototrophic Communities.

Appl Environ Microbiol. 2016 Sep 16;82(19):5878-91. doi: 10.1128/AEM.01630-16. Print 2016 Oct 1.

引用本文的文献

Targeted 16S rRNA Gene Sequencing for Water Samples.

Methods Mol Biol. 2025;2962:65-82. doi: 10.1007/978-1-0716-4726-4_6.

State-of-the-art approaches in the investigation of human seminal bacteriome using metagenomic methods.

Front Reprod Health. 2025 Jun 5;7:1557912. doi: 10.3389/frph.2025.1557912. eCollection 2025.

Standardizing a microbiome pipeline for body fluid identification from complex crime scene stains.

Appl Environ Microbiol. 2025 May 21;91(5):e0187124. doi: 10.1128/aem.01871-24. Epub 2025 Apr 30.

The role of the urinary microbiome in genitourinary cancers.

Nat Rev Urol. 2025 Mar 13. doi: 10.1038/s41585-025-01011-z.

Captive-rearing changes the gut microbiota of the bumblebee native to China.

PeerJ. 2025 Feb 13;13:e18964. doi: 10.7717/peerj.18964. eCollection 2025.

Association between gut microbiome profiles and host metabolic health across the life course: a population-based study.

Lancet Reg Health Eur. 2024 Dec 28;50:101195. doi: 10.1016/j.lanepe.2024.101195. eCollection 2025 Mar.

Association Between Scalp Microbiota Imbalance, Disease Severity, and Systemic Inflammatory Markers in Alopecia Areata.

Dermatol Ther (Heidelb). 2024 Nov;14(11):2971-2986. doi: 10.1007/s13555-024-01281-2. Epub 2024 Oct 10.

Current progresses and challenges for microbiome research in human health: a perspective.

Front Cell Infect Microbiol. 2024 Apr 4;14:1377012. doi: 10.3389/fcimb.2024.1377012. eCollection 2024.

Microbiome single cell atlases generated with a commercial instrument.

Res Sq. 2023 Sep 14:rs.3.rs-3253785. doi: 10.21203/rs.3.rs-3253785/v1.

The potential role of the microbiota in prostate cancer pathogenesis and treatment.

Nat Rev Urol. 2023 Dec;20(12):706-718. doi: 10.1038/s41585-023-00795-2. Epub 2023 Jul 25.

本文引用的文献

Classification of 16S rRNA reads is improved using a niche-specific database constructed by near-full length sequencing.

PLoS One. 2020 Jul 13;15(7):e0235498. doi: 10.1371/journal.pone.0235498. eCollection 2020.

q2-sample-classifier: machine-learning tools for microbiome classification and regression.

J Open Res Softw. 2018;3(30). doi: 10.21105/joss.00934. Epub 2018 Oct 23.

Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2.

Nat Biotechnol. 2019 Aug;37(8):852-857. doi: 10.1038/s41587-019-0209-9.

A human gut bacterial genome and culture collection for improved metagenomic analyses.

Nat Biotechnol. 2019 Feb;37(2):186-192. doi: 10.1038/s41587-018-0009-7. Epub 2019 Feb 4.

Altering the Gut Microbiome of Cattle: Considerations of Host-Microbiome Interactions for Persistent Microbiome Manipulation.

Microb Ecol. 2019 Feb;77(2):523-536. doi: 10.1007/s00248-018-1234-9. Epub 2018 Jul 22.

Combining 16S rRNA gene variable regions enables high-resolution microbial community profiling.

Microbiome. 2018 Jan 26;6(1):17. doi: 10.1186/s40168-017-0396-x.

Shotgun metagenomics, from sampling to analysis.

Nat Biotechnol. 2017 Sep 12;35(9):833-844. doi: 10.1038/nbt.3935.

Profiling the Urinary Microbiome in Men with Positive versus Negative Biopsies for Prostate Cancer.

J Urol. 2018 Jan;199(1):161-171. doi: 10.1016/j.juro.2017.08.001. Epub 2017 Aug 7.

DADA2: High-resolution sample inference from Illumina amplicon data.

Nat Methods. 2016 Jul;13(7):581-3. doi: 10.1038/nmeth.3869. Epub 2016 May 23.

Development of an Analysis Pipeline Characterizing Multiple Hypervariable Regions of 16S rRNA Using Mock Samples.

PLoS One. 2016 Feb 1;11(2):e0148047. doi: 10.1371/journal.pone.0148047. eCollection 2016.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

分析细菌16S rRNA基因测序数据时整合来自多个高变区的数据

Incorporation of Data From Multiple Hypervariable Regions when Analyzing Bacterial 16S rRNA Gene Sequencing Data.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献