从环境微生物中获取基于 ASV 分辨率的 rRNA 操纵子的长读测序策略的基准测试。

Benchmarking long-read sequencing strategies for obtaining ASV-resolved rRNA operons from environmental microeukaryotes.

机构信息

Center for Microbial Communities, Department of Chemistry and Bioscience, Aalborg University, Aalborg, Denmark.

Department of Organismal Biology (Systematic Biology), Uppsala University, Uppsala, Sweden.

出版信息

Mol Ecol Resour. 2024 Oct;24(7):e13991. doi: 10.1111/1755-0998.13991. Epub 2024 Jul 9.

DOI:10.1111/1755-0998.13991

PMID:38979877

Abstract

The use of short-read metabarcoding for classifying microeukaryotes is challenged by the lack of comprehensive 18S rRNA reference databases. While recent advances in high-throughput long-read sequencing provide the potential to greatly increase the phylogenetic coverage of these databases, the performance of different sequencing technologies and subsequent bioinformatics processing remain to be evaluated, primarily because of the absence of well-defined eukaryotic mock communities. To address this challenge, we created a eukaryotic rRNA operon clone-library and turned it into a precisely defined synthetic eukaryotic mock community. This mock community was then used to evaluate the performance of three long-read sequencing strategies (PacBio circular consensus sequencing and two Nanopore approaches using unique molecular identifiers) and three tools for resolving amplicons sequence variants (ASVs) (USEARCH, VSEARCH, and DADA2). We investigated the sensitivity of the sequencing techniques based on the number of detected mock taxa, and the accuracy of the different ASV-calling tools with a specific focus on the presence of chimera among the final rRNA operon ASVs. Based on our findings, we provide recommendations and best practice protocols for how to cost-effectively obtain essentially error-free rRNA operons in high-throughput. An agricultural soil sample was used to demonstrate that the sequencing and bioinformatic results from the mock community also translates to highly diverse natural samples, which enables us to identify previously undescribed microeukaryotic lineages.

摘要

短读代谢条形码在微真核生物分类中的应用受到缺乏全面的 18S rRNA 参考数据库的限制。尽管高通量长读测序的最新进展提供了极大增加这些数据库系统发育覆盖率的潜力，但不同测序技术的性能和随后的生物信息学处理仍有待评估，主要是因为缺乏定义明确的真核生物模拟群落。为了解决这一挑战，我们创建了一个真核 rRNA 操纵子克隆文库，并将其转化为一个精确定义的合成真核模拟群落。然后，我们使用这个模拟群落来评估三种长读测序策略（PacBio 环状一致测序和两种使用独特分子标识符的 Nanopore 方法）和三种用于解决扩增子序列变异 (ASV) 的工具（USEARCH、VSEARCH 和 DADA2）的性能。我们根据检测到的模拟分类单元的数量来研究测序技术的灵敏度，并特别关注最终 rRNA 操纵子 ASV 中嵌合体的存在，来评估不同 ASV 调用工具的准确性。根据我们的发现，我们提供了有关如何经济高效地在高通量中获得基本上无错误的 rRNA 操纵子的建议和最佳实践方案。我们使用农业土壤样本证明了模拟群落的测序和生物信息学结果也适用于高度多样化的自然样本，这使我们能够鉴定以前未描述的微真核生物谱系。

相似文献

Benchmarking long-read sequencing strategies for obtaining ASV-resolved rRNA operons from environmental microeukaryotes.从环境微生物中获取基于 ASV 分辨率的 rRNA 操纵子的长读测序策略的基准测试。

Mol Ecol Resour. 2024 Oct;24(7):e13991. doi: 10.1111/1755-0998.13991. Epub 2024 Jul 9.

The effect of metabarcoding 18S rRNA region choice on diversity of microeukaryotes including phytoplankton.基于 18S rRNA 区选择的 metabarcoding 对包括浮游植物在内的微型真核生物多样性的影响。

World J Microbiol Biotechnol. 2023 Jun 21;39(9):229. doi: 10.1007/s11274-023-03678-1.

Short- and long-read metabarcoding of the eukaryotic rRNA operon: Evaluation of primers and comparison to shotgun metagenomics sequencing.短读和长读真核核糖体 RNA 基因座的宏条形码：引物评估及与 shotgun 宏基因组测序的比较。

Mol Ecol Resour. 2022 Aug;22(6):2304-2318. doi: 10.1111/1755-0998.13623. Epub 2022 May 6.

Microbial Identification Using rRNA Operon Region: Database and Tool for Metataxonomics with Long-Read Sequence.基于 rRNA 操纵子区域的微生物鉴定：长读序列宏基因组学的数据库和工具。

Microbiol Spectr. 2022 Apr 27;10(2):e0201721. doi: 10.1128/spectrum.02017-21. Epub 2022 Mar 30.

Impact of DNA extraction, PCR amplification, sequencing, and bioinformatic analysis on food-associated mock communities using PacBio long-read amplicon sequencing.使用PacBio长读长扩增子测序对与食品相关的模拟群落进行DNA提取、PCR扩增、测序和生物信息学分析的影响。

BMC Microbiol. 2024 Dec 6;24(1):521. doi: 10.1186/s12866-024-03677-8.

Long-read metabarcoding of the eukaryotic rDNA operon to phylogenetically and taxonomically resolve environmental diversity.长读元条形码技术解析真核 rDNA 操纵子以解决环境多样性的系统发育和分类学问题。

Mol Ecol Resour. 2020 Mar;20(2):429-443. doi: 10.1111/1755-0998.13117. Epub 2019 Nov 29.

Confident phylogenetic identification of uncultured prokaryotes through long read amplicon sequencing of the 16S-ITS-23S rRNA operon.通过 16S-ITS-23S rRNA 操纵子的长读扩增子测序对未培养原核生物进行自信的系统发育鉴定。

Environ Microbiol. 2019 Jul;21(7):2485-2498. doi: 10.1111/1462-2920.14636. Epub 2019 May 7.

Long-read DNA metabarcoding of ribosomal RNA in the analysis of fungi from aquatic environments.长读 DNA 核糖体 RNA 元条形码在水生环境真菌分析中的应用。

Mol Ecol Resour. 2018 Nov;18(6):1500-1514. doi: 10.1111/1755-0998.12937. Epub 2018 Sep 23.

The Ribosomal Operon Database: A Full-Length rDNA Operon Database Derived From Genome Assemblies.核糖体操纵子数据库：一个源自基因组组装的全长核糖体DNA操纵子数据库。

Mol Ecol Resour. 2025 Jan;25(1):e14031. doi: 10.1111/1755-0998.14031. Epub 2024 Oct 21.

The newest Oxford Nanopore R10.4.1 full-length 16S rRNA sequencing enables the accurate resolution of species-level microbial community profiling.最新的牛津纳米孔 R10.4.1 全长 16S rRNA 测序可实现精确解析物种水平的微生物群落组成。

Appl Environ Microbiol. 2023 Oct 31;89(10):e0060523. doi: 10.1128/aem.00605-23. Epub 2023 Oct 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验