Department of Biological Sciences, National University of Singapore, 16 Science Drive 4, Singapore, 117558, Singapore.
School of Marine and Environmental Affairs, University of Washington, 3707 Brooklyn Ave NE, Seattle, Washington, 98105, USA.
BMC Genomics. 2024 Sep 9;25(1):842. doi: 10.1186/s12864-024-10767-4.
DNA metabarcoding applies high-throughput sequencing approaches to generate numerous DNA barcodes from mixed sample pools for mass species identification and community characterisation. To date, however, most metabarcoding studies employ second-generation sequencing platforms like Illumina, which are limited by short read lengths and longer turnaround times. While third-generation platforms such as the MinION (Oxford Nanopore Technologies) can sequence longer reads and even in real-time, application of these platforms for metabarcoding has remained limited possibly due to the relatively high read error rates as well as the paucity of specialised software for processing such reads.
We show that this is no longer the case by performing nanopore-based, cytochrome c oxidase subunit I (COI) metabarcoding on 34 zooplankton bulk samples, and benchmarking the results against conventional Illumina MiSeq sequencing. Nanopore R10.3 sequencing chemistry and super accurate (SUP) basecalling model reduced raw read error rates to ~ 4%, and consensus calling with amplicon_sorter (without further error correction) generated metabarcodes that were ≤ 1% erroneous. Although Illumina recovered a higher number of molecular operational taxonomic units (MOTUs) than nanopore sequencing (589 vs. 471), we found no significant differences in the zooplankton communities inferred between the sequencing platforms. Importantly, 406 of 444 (91.4%) shared MOTUs between Illumina and nanopore were also found to be free of indel errors, and 85% of the zooplankton richness could be recovered after just 12-15 h of sequencing.
Our results demonstrate that nanopore sequencing can generate metabarcodes with Illumina-like accuracy, and we are the first study to show that nanopore metabarcodes are almost always indel-free. We also show that nanopore metabarcoding is viable for characterising species-rich communities rapidly, and that the same ecological conclusions can be obtained regardless of the sequencing platform used. Collectively, our study inspires confidence in nanopore sequencing and paves the way for greater utilisation of nanopore technology in various metabarcoding applications.
DNA 代谢组学应用高通量测序方法从混合样本池中生成大量 DNA 条码,用于大规模物种鉴定和群落特征分析。然而,迄今为止,大多数代谢组学研究都采用第二代测序平台,如 Illumina,其受到短读长和较长周转时间的限制。虽然第三代平台,如 MinION(Oxford Nanopore Technologies)可以进行更长的读长测序,甚至实时测序,但由于相对较高的读取错误率以及缺乏专门用于处理此类读取的软件,这些平台在代谢组学中的应用仍然有限。
我们通过对 34 个浮游动物总样本进行基于纳米孔的细胞色素 c 氧化酶亚基 I(COI)代谢组学研究,并将结果与传统的 Illumina MiSeq 测序进行基准测试,表明情况已不再如此。纳米孔 R10.3 测序化学和超级准确(SUP)碱基调用模型将原始读取错误率降低至~4%,并且使用 amplicon_sorter 进行共识调用(无需进一步纠错)生成的代谢条码错误率≤1%。虽然 Illumina 比纳米孔测序恢复的分子操作分类单元(MOTU)数量更高(589 对 471),但我们没有发现两种测序平台推断的浮游动物群落之间存在显著差异。重要的是,在 Illumina 和纳米孔之间有 444 个(91.4%)共享 MOTU 的情况下,没有发现插入缺失错误,并且在 12-15 小时的测序后可以恢复 85%的浮游动物丰富度。
我们的结果表明,纳米孔测序可以生成与 Illumina 类似准确性的代谢条码,并且我们是第一个表明纳米孔代谢条码几乎总是无插入缺失错误的研究。我们还表明,纳米孔代谢组学可以快速分析物种丰富的群落,并且无论使用哪种测序平台,都可以得出相同的生态结论。总的来说,我们的研究为纳米孔测序提供了信心,并为纳米孔技术在各种代谢组学应用中的更广泛应用铺平了道路。