引物、流程、参数：16S rRNA 基因测序中的问题。

Primer, Pipelines, Parameters: Issues in 16S rRNA Gene Sequencing.

机构信息

Core Facility Microbiome, ZIEL-Institute for Food & Health, Technische Universität München, Freising, Germany.

Chair of Experimental Bioinformatics, TUM School of Life Sciences Weihenstephan, Technische Universität München, Freising, Germany.

出版信息

mSphere. 2021 Feb 24;6(1):e01202-20. doi: 10.1128/mSphere.01202-20.

DOI:10.1128/mSphere.01202-20

PMID:33627512

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8544895/

Abstract

Short-amplicon 16S rRNA gene sequencing is currently the method of choice for studies investigating microbiomes. However, comparative studies on differences in procedures are scarce. We sequenced human stool samples and mock communities with increasing complexity using a variety of commonly used protocols. Short amplicons targeting different variable regions (V-regions) or ranges thereof (V1-V2, V1-V3, V3-V4, V4, V4-V5, V6-V8, and V7-V9) were investigated for differences in the composition outcome due to primer choices. Next, the influence of clustering (operational taxonomic units [OTUs], zero-radius OTUs [zOTUs], and amplicon sequence variants [ASVs]), different databases (GreenGenes, the Ribosomal Database Project, Silva, the genomic-based 16S rRNA Database, and The All-Species Living Tree), and bioinformatic settings on taxonomic assignment were also investigated. We present a systematic comparison across all typically used V-regions using well-established primers. While it is known that the primer choice has a significant influence on the resulting microbial composition, we show that microbial profiles generated using different primer pairs need independent validation of performance. Further, comparing data sets across V-regions using different databases might be misleading due to differences in nomenclature (e.g., versus ) and varying precisions in classification down to genus level. Overall, specific but important taxa are not picked up by certain primer pairs (e.g., is missed using primers 515F-944R) or due to the database used (e.g., in GreenGenes and the genomic-based 16S rRNA Database). We found that appropriate truncation of amplicons is essential and different truncated-length combinations should be tested for each study. Finally, specific mock communities of sufficient and adequate complexity are highly recommended. In 16S rRNA gene sequencing, certain bacterial genera were found to be underrepresented or even missing in taxonomic profiles when using unsuitable primer combinations, outdated reference databases, or inadequate pipeline settings. Concerning the last, quality thresholds as well as bioinformatic settings (i.e., clustering approach, analysis pipeline, and specific adjustments such as truncation) are responsible for a number of observed differences between studies. Conclusions drawn by comparing one data set to another (e.g., between publications) appear to be problematic and require independent cross-validation using matching V-regions and uniform data processing. Therefore, we highlight the importance of a thought-out study design including sufficiently complex mock standards and appropriate V-region choice for the sample of interest. The use of processing pipelines and parameters must be tested beforehand.

摘要

短扩增子 16S rRNA 基因测序目前是研究微生物组的首选方法。然而，关于程序差异的比较研究很少。我们使用各种常用的方案对越来越复杂的人类粪便样本和模拟群落进行了测序。我们研究了针对不同可变区（V 区）或其范围（V1-V2、V1-V3、V3-V4、V4、V4-V5、V6-V8 和 V7-V9）的短引物在组成结果上的差异，因为引物的选择会导致差异。接下来，我们还研究了聚类（操作分类单元[OTU]、零半径 OTU[zOTU]和扩增子序列变体[ASV]）、不同数据库（GreenGenes、核糖体数据库项目、Silva、基于基因组的 16S rRNA 数据库和所有物种生命树）以及生物信息学设置对分类分配的影响。我们使用经过验证的引物对所有常用的 V 区进行了系统比较。虽然已知引物选择对微生物组成的结果有重大影响，但我们表明，使用不同引物对生成的微生物谱需要独立验证其性能。此外，由于命名法（例如，和）和分类到属级别的精度不同，使用不同数据库在 V 区之间比较数据集可能会产生误导。总体而言，某些特定但重要的分类群可能会被某些引物对漏掉（例如，引物 515F-944R 不会检测到）或由于使用的数据库（例如，GreenGenes 和基于基因组的 16S rRNA 数据库中没有）。我们发现，适当的扩增子截断是必不可少的，并且应该针对每个研究测试不同的截断长度组合。最后，强烈推荐使用足够和适当复杂的模拟群落。在 16S rRNA 基因测序中，当使用不合适的引物组合、过时的参考数据库或不充分的管道设置时，某些细菌属在分类图谱中被低估或甚至缺失。关于最后一点，质量阈值以及生物信息学设置（即聚类方法、分析管道以及特定调整，如截断）负责解释许多研究之间的差异。通过将一个数据集与另一个数据集（例如，在出版物之间）进行比较得出的结论似乎存在问题，需要使用匹配的 V 区和统一的数据处理进行独立的交叉验证。因此，我们强调了在研究设计中包含足够复杂的模拟标准和适当的 V 区选择的重要性，以便对感兴趣的样本进行研究。在进行测序之前，必须对处理管道和参数进行测试。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a7e/8544895/c1813486c059/msphere.01202-20-f0001.jpg

相似文献

Primer, Pipelines, Parameters: Issues in 16S rRNA Gene Sequencing.

mSphere. 2021 Feb 24;6(1):e01202-20. doi: 10.1128/mSphere.01202-20.

Impact of DNA Sequencing and Analysis Methods on 16S rRNA Gene Bacterial Community Analysis of Dairy Products.

mSphere. 2018 Oct 17;3(5):e00410-18. doi: 10.1128/mSphere.00410-18.

Optimisation of methods for bacterial skin microbiome investigation: primer selection and comparison of the 454 versus MiSeq platform.

BMC Microbiol. 2017 Jan 21;17(1):23. doi: 10.1186/s12866-017-0927-4.

GSR-DB: a manually curated and optimized taxonomical database for 16S rRNA amplicon analysis.

mSystems. 2024 Feb 20;9(2):e0095023. doi: 10.1128/msystems.00950-23. Epub 2024 Jan 8.

Optimisation of 16S rRNA gut microbiota profiling of extremely low birth weight infants.

BMC Genomics. 2017 Nov 2;18(1):841. doi: 10.1186/s12864-017-4229-x.

Primer Design for an Accurate View of Picocyanobacterial Community Structure by Using High-Throughput Sequencing.

Appl Environ Microbiol. 2019 Mar 22;85(7). doi: 10.1128/AEM.02659-18. Print 2019 Apr 1.

Multi-amplicon microbiome data analysis pipelines for mixed orientation sequences using QIIME2: Assessing reference database, variable region and pre-processing bias in classification of mock bacterial community samples.

PLoS One. 2023 Jan 13;18(1):e0280293. doi: 10.1371/journal.pone.0280293. eCollection 2023.

Determining the most accurate 16S rRNA hypervariable region for taxonomic identification from respiratory samples.

Sci Rep. 2023 Mar 9;13(1):3974. doi: 10.1038/s41598-023-30764-z.

A multi-amplicon 16S rRNA sequencing and analysis method for improved taxonomic profiling of bacterial communities.

J Microbiol Methods. 2018 Nov;154:6-13. doi: 10.1016/j.mimet.2018.09.019. Epub 2018 Sep 29.

Evaluation of 16S rRNA amplicon sequencing using two next-generation sequencing technologies for phylogenetic analysis of the rumen bacterial community in steers.

J Microbiol Methods. 2016 Aug;127:132-140. doi: 10.1016/j.mimet.2016.06.004. Epub 2016 Jun 6.

引用本文的文献

Next-generation sequencing applications in food science: fundamentals and recent advances.

Front Bioeng Biotechnol. 2025 Aug 20;13:1638957. doi: 10.3389/fbioe.2025.1638957. eCollection 2025.

A quartet-based approach for inferring phylogenetically informative features from genomic and phenomic data.

Comput Struct Biotechnol J. 2025 Aug 22;27:3710-3718. doi: 10.1016/j.csbj.2025.08.015. eCollection 2025.

Why Are Long-Read Sequencing Methods Revolutionizing Microbiome Analysis?

Microorganisms. 2025 Aug 9;13(8):1861. doi: 10.3390/microorganisms13081861.

Cervicovaginal Microbiome and HPV: A Standardized Approach to 16S/ITS NGS and Microbial Community Profiling for Viral Association.

Int J Mol Sci. 2025 Aug 21;26(16):8090. doi: 10.3390/ijms26168090.

Revolutionizing cancer treatment with Halomonas Aquamarina L-Glutaminase: insights from in vitro and computational studies.

Sci Rep. 2025 Aug 24;15(1):31086. doi: 10.1038/s41598-025-14230-6.

Benchmarking 16S rRNA Gene-Based Approaches to Bacterial Taxonomy Assignment Based on Amplicon Sequencing With Illumina and Oxford Nanopore.

Int J Microbiol. 2025 Aug 13;2025:7563096. doi: 10.1155/ijm/7563096. eCollection 2025.

Comparative evaluation of sequencing platforms: Pacific Biosciences, Oxford Nanopore Technologies, and Illumina for 16S rRNA-based soil microbiome profiling.

Front Microbiol. 2025 Aug 6;16:1633360. doi: 10.3389/fmicb.2025.1633360. eCollection 2025.

Stabilized and unstabilized sampling methods result in differential fecal 16S rRNA microbial sequencing results.

PLoS One. 2025 Aug 13;20(8):e0324351. doi: 10.1371/journal.pone.0324351. eCollection 2025.

Comparative effects of raw and processed cistanche glycosides on the HPT axis and gut microbiota in a rat model of kidney-yang deficiency.

Front Pharmacol. 2025 Jul 25;16:1597564. doi: 10.3389/fphar.2025.1597564. eCollection 2025.

Metagenomic analysis reveals methanogenic and other archaeal genes in the digestive tract of invasive Japanese beetle larvae and associated soil.

Front Microbiol. 2025 Jul 25;16:1609893. doi: 10.3389/fmicb.2025.1609893. eCollection 2025.

本文引用的文献

Comparing Circadian Rhythmicity in the Human Gut Microbiome.

STAR Protoc. 2020 Oct 26;1(3):100148. doi: 10.1016/j.xpro.2020.100148. eCollection 2020 Dec 18.

Generation of Comprehensive Ecosystem-Specific Reference Databases with Species-Level Resolution by High-Throughput Full-Length 16S rRNA Gene Sequencing and Automated Taxonomy Assignment (AutoTax).

mBio. 2020 Sep 22;11(5):e01557-20. doi: 10.1128/mBio.01557-20.

The Influences of Bioinformatics Tools and Reference Databases in Analyzing the Human Oral Microbial Community.

Genes (Basel). 2020 Aug 3;11(8):878. doi: 10.3390/genes11080878.

Comparison of Bioinformatics Pipelines and Operating Systems for the Analyses of 16S rRNA Gene Amplicon Sequences in Human Fecal Samples.

Front Microbiol. 2020 Jun 17;11:1262. doi: 10.3389/fmicb.2020.01262. eCollection 2020.

Arrhythmic Gut Microbiome Signatures Predict Risk of Type 2 Diabetes.

Cell Host Microbe. 2020 Aug 12;28(2):258-272.e6. doi: 10.1016/j.chom.2020.06.004. Epub 2020 Jul 2.

Variations of Gut Microbiome Profile Under Different Storage Conditions and Preservation Periods: A Multi-Dimensional Evaluation.

Front Microbiol. 2020 May 27;11:972. doi: 10.3389/fmicb.2020.00972. eCollection 2020.

Construction of habitat-specific training sets to achieve species-level assignment in 16S rRNA gene datasets.

Microbiome. 2020 May 15;8(1):65. doi: 10.1186/s40168-020-00841-w.

The nf-core framework for community-curated bioinformatics pipelines.

Nat Biotechnol. 2020 Mar;38(3):276-278. doi: 10.1038/s41587-020-0439-x.

Toward Standards in Clinical Microbiota Studies: Comparison of Three DNA Extraction Methods and Two Bioinformatic Pipelines.

mSystems. 2020 Feb 11;5(1):e00547-19. doi: 10.1128/mSystems.00547-19.

Comparison of five assays for DNA extraction from bacterial cells in human faecal samples.

J Appl Microbiol. 2020 Aug;129(2):378-388. doi: 10.1111/jam.14608. Epub 2020 Feb 20.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

引物、流程、参数：16S rRNA 基因测序中的问题。

Primer, Pipelines, Parameters: Issues in 16S rRNA Gene Sequencing.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献