Centre for Environmental Genomics Applications, eDNAtec Inc., St. John's, NL, Canada.
Centre for Biodiversity Genomics & Department of Integrative Biology, University of Guelph, Guelph, ON, Canada.
Sci Rep. 2019 Apr 12;9(1):5991. doi: 10.1038/s41598-019-42455-9.
The characterization of biodiversity is a crucial element of ecological investigations as well as environmental assessment and monitoring activities. Increasingly, amplicon-based environmental DNA metabarcoding (alternatively, marker gene metagenomics) is used for such studies given its ability to provide biodiversity data from various groups of organisms simply from analysis of bulk environmental samples such as water, soil or sediments. The Illumina MiSeq is currently the most popular tool for carrying out this work, but we set out to determine whether typical studies were reading enough DNA to detect rare organisms (i.e., those that may be of greatest interest such as endangered or invasive species) present in the environment. We collected sea water samples along two transects in Conception Bay, Newfoundland and analyzed them on the MiSeq with a sequencing depth of 100,000 reads per sample (exceeding the 60,000 per sample that is typical of similar studies). We then analyzed these same samples on Illumina's newest high-capacity platform, the NovaSeq, at a depth of 7 million reads per sample. Not surprisingly, the NovaSeq detected many more taxa than the MiSeq thanks to its much greater sequencing depth. However, contrary to our expectations this pattern was true even in depth-for-depth comparisons. In other words, the NovaSeq can detect more DNA sequence diversity within samples than the MiSeq, even at the exact same sequencing depth. Even when samples were reanalyzed on the MiSeq with a sequencing depth of 1 million reads each, the MiSeq's ability to detect new sequences plateaued while the NovaSeq continued to detect new sequence variants. These results have important biological implications. The NovaSeq found 40% more metazoan families in this environment than the MiSeq, including some of interest such as marine mammals and bony fish so the real-world implications of these findings are significant. These results are most likely associated to the advances incorporated in the NovaSeq, especially a patterned flow cell, which prevents similar sequences that are neighbours on the flow cell (common in metabarcoding studies) from being erroneously merged into single spots by the sequencing instrument. This study sets the stage for incorporating eDNA metabarcoding in comprehensive analysis of oceanic samples in a wide range of ecological and environmental investigations.
生物多样性的特征是生态调查以及环境评估和监测活动的关键要素。越来越多的情况下,基于扩增子的环境 DNA 宏条形码(或者说标记基因宏基因组学)被用于此类研究,因为它能够仅通过分析水样、土壤样或沉积物等批量环境样本,提供来自各种生物类群的生物多样性数据。Illumina MiSeq 目前是开展此类工作最常用的工具,但我们着手确定典型研究是否读取了足够的 DNA 来检测环境中存在的稀有生物(即那些最感兴趣的生物,如濒危或入侵物种)。我们沿着纽芬兰省康塞普申湾的两条横断带采集海水样本,并在 MiSeq 上进行分析,每个样本的测序深度为 100,000 个读数(超过类似研究中典型的 60,000 个读数)。然后,我们在 Illumina 最新的高通量平台 NovaSeq 上以每个样本 700 万个读数的深度进行了分析。毫不奇怪,NovaSeq 由于其更大的测序深度,检测到的分类单元比 MiSeq 多得多。然而,与我们的预期相反,即使在深度相等的比较中,也是如此。换句话说,即使在相同的测序深度下,NovaSeq 也可以在样本中检测到比 MiSeq 更多的 DNA 序列多样性。即使在 MiSeq 上对样本进行重新分析,每个样本的测序深度为 100 万个读数,MiSeq 检测新序列的能力也达到了顶峰,而 NovaSeq 仍在继续检测新的序列变体。这些结果具有重要的生物学意义。在这个环境中,NovaSeq 发现的后生动物科比 MiSeq 多 40%,包括一些感兴趣的科,如海洋哺乳动物和硬骨鱼,因此这些发现的实际意义重大。这些结果很可能与 NovaSeq 中包含的技术进步有关,特别是一种图案化的流动池,该流动池可防止流动池上相邻的(在宏条形码研究中很常见)相似序列被测序仪器错误地合并到单个斑点中。这项研究为在广泛的生态和环境调查中对海洋样本进行综合分析纳入 eDNA 宏条形码奠定了基础。