优化 DADA2 参数，用于从活性污泥中对 16S rRNA 基因进行多区域代谢组学分析，并比较分类能力和分类数据库。

Fine-Tuning of DADA2 Parameters for Multiregional Metabarcoding Analysis of 16S rRNA Genes from Activated Sludge and Comparison of Taxonomy Classification Power and Taxonomy Databases.

机构信息

Department of Plant Physiology, Genetics and Biotechnology, University of Warmia and Mazury in Olsztyn, 10-719 Olsztyn, Poland.

Department of Environmental Biotechnology, University of Warmia and Mazury in Olsztyn, 11-709 Olsztyn, Poland.

出版信息

Int J Mol Sci. 2024 Mar 20;25(6):3508. doi: 10.3390/ijms25063508.

DOI:10.3390/ijms25063508

PMID:38542482

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10971298/

Abstract

Taxonomic classification using metabarcoding is a commonly used method in microbiological studies of environmental samples and during monitoring of biotechnological processes. However, it is difficult to compare results from different laboratories, due to the variety of bioinformatics tools that have been developed and used for data analysis. This problem is compounded by different choices regarding which variable region of the gene and which database is used for taxonomic identification. Therefore, this study employed the DADA2 algorithm to optimize the preprocessing of raw data obtained from the sequencing of activated sludge samples, using simultaneous analysis of three frequently used regions of (V1-V3, V3-V4, V4-V5). Additionally, the study evaluated which variable region and which of the frequently used microbial databases for taxonomic classification (Greengenes2, Silva, RefSeq) more accurately classify OTUs into taxa. Adjusting the values of selected parameters of the DADA2 algorithm, we obtained the highest possible numbers of OTUs for each region. Regarding biodiversity within regions, the V3-V4 region had the highest Simpson and Shannon indexes, and the Chao1 index was similar to that of the V1-V3 region. Beta-biodiversity analysis revealed statistically significant differences between regions. When comparing databases for each of the regions studied, the highest numbers of taxonomic groups were obtained using the SILVA database. These results suggest that standardization of metabarcoding of short amplicons may be possible.

摘要

基于代谢组学的分类学分类是环境样本微生物学研究和生物技术过程监测中常用的方法。然而，由于已经开发并用于数据分析的生物信息学工具种类繁多，因此很难比较来自不同实验室的结果。这个问题因用于分类鉴定的基因的不同可变区和数据库的选择而更加复杂。因此，本研究采用 DADA2 算法来优化活性污泥样品测序获得的原始数据的预处理，同时分析（V1-V3、V3-V4、V4-V5）三个常用区域。此外，该研究评估了哪个可变区和哪些常用于分类的微生物数据库（Greengenes2、Silva、RefSeq）能更准确地将 OTU 分类为分类单元。通过调整 DADA2 算法的选定参数的值，我们为每个区域获得了尽可能多的 OTU。关于区域内的生物多样性，V3-V4 区域具有最高的 Simpson 和 Shannon 指数，而 Chao1 指数与 V1-V3 区域相似。β-生物多样性分析显示区域之间存在统计学上的显著差异。当比较每个研究区域的数据库时，使用 SILVA 数据库获得了最多的分类群数量。这些结果表明，短扩增子代谢组学的标准化可能是可行的。

相似文献

Fine-Tuning of DADA2 Parameters for Multiregional Metabarcoding Analysis of 16S rRNA Genes from Activated Sludge and Comparison of Taxonomy Classification Power and Taxonomy Databases.

Int J Mol Sci. 2024 Mar 20;25(6):3508. doi: 10.3390/ijms25063508.

Primer, Pipelines, Parameters: Issues in 16S rRNA Gene Sequencing.

mSphere. 2021 Feb 24;6(1):e01202-20. doi: 10.1128/mSphere.01202-20.

GSR-DB: a manually curated and optimized taxonomical database for 16S rRNA amplicon analysis.

mSystems. 2024 Feb 20;9(2):e0095023. doi: 10.1128/msystems.00950-23. Epub 2024 Jan 8.

Improving Species Level-taxonomic Assignment from 16S rRNA Sequencing Technologies.

Curr Protoc. 2023 Nov;3(11):e930. doi: 10.1002/cpz1.930.

rpoB, a promising marker for analyzing the diversity of bacterial communities by amplicon sequencing.

BMC Microbiol. 2019 Jul 29;19(1):171. doi: 10.1186/s12866-019-1546-z.

Determining the most accurate 16S rRNA hypervariable region for taxonomic identification from respiratory samples.

Sci Rep. 2023 Mar 9;13(1):3974. doi: 10.1038/s41598-023-30764-z.

Optimal 16S rRNA gene amplicon sequencing analysis for oral microbiota to avoid the potential bias introduced by trimming length, primer, and database.

Microbiol Spectr. 2024 Oct 22;12(12):e0351223. doi: 10.1128/spectrum.03512-23.

Comparing DNA isolation and sequencing strategies for 16S rRNA gene amplicon analysis in biofilm containing environments.

J Microbiol Methods. 2024 May;220:106921. doi: 10.1016/j.mimet.2024.106921. Epub 2024 Mar 16.

The bias associated with amplicon sequencing does not affect the quantitative assessment of bacterial community dynamics.

PLoS One. 2014 Jun 12;9(6):e99722. doi: 10.1371/journal.pone.0099722. eCollection 2014.

A multi-amplicon 16S rRNA sequencing and analysis method for improved taxonomic profiling of bacterial communities.

J Microbiol Methods. 2018 Nov;154:6-13. doi: 10.1016/j.mimet.2018.09.019. Epub 2018 Sep 29.

引用本文的文献

Special Issue "Current Research on Omics of Microorganisms".

Int J Mol Sci. 2025 Sep 3;26(17):8553. doi: 10.3390/ijms26178553.

本文引用的文献

Author Correction: Greengenes2 unifies microbial data in a single reference tree.

Nat Biotechnol. 2024 May;42(5):813. doi: 10.1038/s41587-023-02026-w.

Greengenes2 unifies microbial data in a single reference tree.

Nat Biotechnol. 2024 May;42(5):715-718. doi: 10.1038/s41587-023-01845-1. Epub 2023 Jul 27.

Comparative study of multiple approaches for identifying cultivable microalgae population diversity from freshwater samples.

PLoS One. 2023 Jul 7;18(7):e0285913. doi: 10.1371/journal.pone.0285913. eCollection 2023.

Evaluation of DNA extraction methods and direct PCR in metabarcoding of mock and marine bacterial communities.

Front Microbiol. 2023 Apr 17;14:1151907. doi: 10.3389/fmicb.2023.1151907. eCollection 2023.

Determining the most accurate 16S rRNA hypervariable region for taxonomic identification from respiratory samples.

Sci Rep. 2023 Mar 9;13(1):3974. doi: 10.1038/s41598-023-30764-z.

Influence of seasonality, wastewater treatment plant process, geographical location and environmental parameters on bacterial community selection in activated sludge wastewater treatment plants treating municipal sewage in South Africa.

Environ Res. 2023 Apr 1;222:115394. doi: 10.1016/j.envres.2023.115394. Epub 2023 Jan 30.

Multi-amplicon microbiome data analysis pipelines for mixed orientation sequences using QIIME2: Assessing reference database, variable region and pre-processing bias in classification of mock bacterial community samples.

PLoS One. 2023 Jan 13;18(1):e0280293. doi: 10.1371/journal.pone.0280293. eCollection 2023.

Correction to 'The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2022 update'.

Nucleic Acids Res. 2022 Aug 26;50(15):8999. doi: 10.1093/nar/gkac610.

RESCRIPt: Reproducible sequence taxonomy reference database management.

PLoS Comput Biol. 2021 Nov 8;17(11):e1009581. doi: 10.1371/journal.pcbi.1009581. eCollection 2021 Nov.

From the Andes to the desert: 16S rRNA metabarcoding characterization of aquatic bacterial communities in the Rimac river, the main source of water for Lima, Peru.

PLoS One. 2021 Apr 22;16(4):e0250401. doi: 10.1371/journal.pone.0250401. eCollection 2021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

优化 DADA2 参数，用于从活性污泥中对 16S rRNA 基因进行多区域代谢组学分析，并比较分类能力和分类数据库。

Fine-Tuning of DADA2 Parameters for Multiregional Metabarcoding Analysis of 16S rRNA Genes from Activated Sludge and Comparison of Taxonomy Classification Power and Taxonomy Databases.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献