随机抽样导致在Illumina COI宏条形码分析中稀有真核生物操作分类单元的重现性较低。

Random sampling causes the low reproducibility of rare eukaryotic OTUs in Illumina COI metabarcoding.

作者信息

Leray Matthieu, Knowlton Nancy

机构信息

National Museum of Natural History, Smithsonian Institution, Washington, D.C., USA; Smithsonian Tropical Research Institute, Smithsonian Institution, Panama City, Balboa, Ancon, Republic of Panama.

National Museum of Natural History, Smithsonian Institution , Washington , D.C. , USA.

出版信息

PeerJ. 2017 Mar 22;5:e3006. doi: 10.7717/peerj.3006. eCollection 2017.

DOI:10.7717/peerj.3006

PMID:28348924

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5364921/

Abstract

DNA metabarcoding, the PCR-based profiling of natural communities, is becoming the method of choice for biodiversity monitoring because it circumvents some of the limitations inherent to traditional ecological surveys. However, potential sources of bias that can affect the reproducibility of this method remain to be quantified. The interpretation of differences in patterns of sequence abundance and the ecological relevance of rare sequences remain particularly uncertain. Here we used one artificial mock community to explore the significance of abundance patterns and disentangle the effects of two potential biases on data reproducibility: indexed PCR primers and random sampling during Illumina MiSeq sequencing. We amplified a short fragment of the mitochondrial Cytochrome c Oxidase Subunit I (COI) for a single mock sample containing equimolar amounts of total genomic DNA from 34 marine invertebrates belonging to six phyla. We used seven indexed broad-range primers and sequenced the resulting library on two consecutive Illumina MiSeq runs. The total number of Operational Taxonomic Units (OTUs) was ∼4 times higher than expected based on the composition of the mock sample. Moreover, the total number of reads for the 34 components of the mock sample differed by up to three orders of magnitude. However, 79 out of 86 of the unexpected OTUs were represented by <10 sequences that did not appear consistently across replicates. Our data suggest that random sampling of rare OTUs (e.g., small associated fauna such as parasites) accounted for most of variation in OTU presence-absence, whereas biases associated with indexed PCRs accounted for a larger amount of variation in relative abundance patterns. These results suggest that random sampling during sequencing leads to the low reproducibility of rare OTUs. We suggest that the strategy for handling rare OTUs should depend on the objectives of the study. Systematic removal of rare OTUs may avoid inflating diversity based on common descriptors but will exclude positive records of taxa that are functionally important. Our results further reinforce the need for technical replicates (parallel PCR and sequencing from the same sample) in metabarcoding experimental designs. Data reproducibility should be determined empirically as it will depend upon the sequencing depth, the type of sample, the sequence analysis pipeline, and the number of replicates. Moreover, estimating relative biomasses or abundances based on read counts remains elusive at the OTU level.

摘要

DNA 宏条形码技术，即基于聚合酶链式反应（PCR）对自然群落进行特征分析，正成为生物多样性监测的首选方法，因为它规避了传统生态调查中固有的一些局限性。然而，可能影响该方法可重复性的潜在偏差来源仍有待量化。序列丰度模式差异的解读以及稀有序列的生态相关性仍然特别不确定。在这里，我们使用一个人工模拟群落来探讨丰度模式的重要性，并剖析两种潜在偏差对数据可重复性的影响：索引 PCR 引物和 Illumina MiSeq 测序过程中的随机抽样。我们针对一个模拟样本扩增了线粒体细胞色素 c 氧化酶亚基 I（COI）的短片段，该样本包含来自六个门的 34 种海洋无脊椎动物等摩尔量的总基因组 DNA。我们使用了七种索引宽范围引物，并在连续两次 Illumina MiSeq 运行中对所得文库进行测序。操作分类单元（OTU）的总数比基于模拟样本组成预期的高出约 4 倍。此外，模拟样本的 34 个组分的读取总数相差高达三个数量级。然而，86 个意外 OTU 中的 79 个由少于 10 条序列代表，这些序列在重复样本中并非始终出现。我们的数据表明，稀有 OTU（例如寄生虫等小型伴生动物群）的随机抽样占 OTU 存在与否变化的大部分，而与索引 PCR 相关的偏差在相对丰度模式变化中占比更大。这些结果表明，测序过程中的随机抽样导致稀有 OTU 的可重复性较低。我们建议处理稀有 OTU 的策略应取决于研究目的。系统去除稀有 OTU 可能避免基于常见描述符夸大多样性，但会排除功能上重要的分类群的阳性记录。我们的结果进一步强调了在宏条形码实验设计中进行技术重复（从同一样本进行平行 PCR 和测序）的必要性。数据可重复性应根据经验确定，因为它将取决于测序深度、样本类型、序列分析流程和重复次数。此外，在 OTU 水平上基于读取计数估计相对生物量或丰度仍然难以实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/722f/5364921/702f6e97ad4d/peerj-05-3006-g001.jpg

相似文献

Random sampling causes the low reproducibility of rare eukaryotic OTUs in Illumina COI metabarcoding.随机抽样导致在Illumina COI宏条形码分析中稀有真核生物操作分类单元的重现性较低。

PeerJ. 2017 Mar 22;5:e3006. doi: 10.7717/peerj.3006. eCollection 2017.

Estimating intraspecific genetic diversity from community DNA metabarcoding data.从群落DNA宏条形码数据估计种内遗传多样性。

PeerJ. 2018 Apr 9;6:e4644. doi: 10.7717/peerj.4644. eCollection 2018.

Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform.使用Illumina MiSeq平台评估扩增子测序的可重复性。

PLoS One. 2017 Apr 28;12(4):e0176716. doi: 10.1371/journal.pone.0176716. eCollection 2017.

Can DNA-Based Ecosystem Assessments Quantify Species Abundance? Testing Primer Bias and Biomass--Sequence Relationships with an Innovative Metabarcoding Protocol.基于DNA的生态系统评估能否量化物种丰度？使用创新的代谢条形码协议测试引物偏差和生物量与序列的关系。

PLoS One. 2015 Jul 8;10(7):e0130324. doi: 10.1371/journal.pone.0130324. eCollection 2015.

A metabarcoding framework for facilitated survey of endolithic phototrophs with tufA.一种利用tufA促进对石内光合生物进行调查的宏条形码框架。

BMC Ecol. 2016 Mar 10;16:8. doi: 10.1186/s12898-016-0068-x.

Disparities in second-generation DNA metabarcoding results exposed with accessible and repeatable workflows.第二代 DNA 代谢组学结果的差异通过可及且可重复的工作流程得以揭示。

Mol Ecol Resour. 2018 May;18(3):590-601. doi: 10.1111/1755-0998.12770. Epub 2018 Mar 8.

Accurate Estimation of Fungal Diversity and Abundance through Improved Lineage-Specific Primers Optimized for Illumina Amplicon Sequencing.通过改进针对Illumina扩增子测序优化的谱系特异性引物准确估计真菌多样性和丰度。

Appl Environ Microbiol. 2016 Nov 21;82(24):7217-7226. doi: 10.1128/AEM.02576-16. Print 2016 Dec 15.

The effect of low-abundance OTU filtering methods on the reliability and variability of microbial composition assessed by 16S rRNA amplicon sequencing.低丰度 OTU 过滤方法对 16S rRNA 扩增子测序评估的微生物组成的可靠性和可变性的影响。

Front Cell Infect Microbiol. 2023 Jun 12;13:1165295. doi: 10.3389/fcimb.2023.1165295. eCollection 2023.

DNA barcoding and metabarcoding of standardized samples reveal patterns of marine benthic diversity.标准化样本的DNA条形码和宏条形码揭示了海洋底栖生物多样性模式。

Proc Natl Acad Sci U S A. 2015 Feb 17;112(7):2076-81. doi: 10.1073/pnas.1424997112. Epub 2015 Feb 2.

Estimating the biodiversity of terrestrial invertebrates on a forested island using DNA barcodes and metabarcoding data.利用 DNA 条形码和代谢组学数据估算森林岛屿上的陆地无脊椎动物的生物多样性。

Ecol Appl. 2019 Jun;29(4):e01877. doi: 10.1002/eap.1877. Epub 2019 Apr 8.

引用本文的文献

Development of PCR Blocking Primers Enabling DNA Metabarcoding Analysis of Dietary Composition in Hematophagous Sea Lamprey.用于食血海七鳃鳗饮食成分DNA代谢条形码分析的PCR阻断引物的开发

Ecol Evol. 2025 Aug 20;15(8):e71999. doi: 10.1002/ece3.71999. eCollection 2025 Aug.

Brute force prey metabarcoding to explore the diets of small invertebrates.采用强力猎物代谢条形码技术探索小型无脊椎动物的食性。

Ecol Evol. 2024 May 6;14(5):e11369. doi: 10.1002/ece3.11369. eCollection 2024 May.

A comparison of two gene regions for assessing community composition of eukaryotic marine microalgae from coastal ecosystems.比较两个基因区域，用于评估沿海生态系统中真核海洋微藻的群落组成。

Sci Rep. 2024 Mar 18;14(1):6442. doi: 10.1038/s41598-024-56993-4.

A metabarcode based (species) inventory of the northern Adriatic phytoplankton.基于元条形码技术的亚得里亚海北部浮游植物（物种）清单。

Biodivers Data J. 2023 Sep 25;11:e106947. doi: 10.3897/BDJ.11.e106947. eCollection 2023.

Genetic Markers for Metabarcoding of Freshwater Microalgae: Review.淡水微藻代谢条形码的遗传标记：综述

Biology (Basel). 2023 Jul 22;12(7):1038. doi: 10.3390/biology12071038.

Signal and noise in metabarcoding data.代谢组条形码数据中的信号与噪声。

PLoS One. 2023 May 11;18(5):e0285674. doi: 10.1371/journal.pone.0285674. eCollection 2023.

Benthic invertebrates in Svalbard fjords-when metabarcoding does not outperform traditional biodiversity assessment.斯瓦尔巴德峡湾的底栖无脊椎动物——当 metabarcoding 未能优于传统生物多样性评估时。

PeerJ. 2022 Nov 17;10:e14321. doi: 10.7717/peerj.14321. eCollection 2022.

Mitochondrial cytochrome c oxidase subunit I (COI) metabarcoding of Foraminifera communities using taxon-specific primers.使用基于分类的引物对有孔虫群落进行线粒体细胞色素 c 氧化酶亚基 I（COI）代谢组学分析。

PeerJ. 2022 Sep 5;10:e13952. doi: 10.7717/peerj.13952. eCollection 2022.

The gut microbiome variability of a butterflyfish increases on severely degraded Caribbean reefs.蝴蝶鱼的肠道微生物组变异性在严重退化的加勒比海礁中增加。

Commun Biol. 2022 Jul 30;5(1):770. doi: 10.1038/s42003-022-03679-0.

Strategies for molecular authentication of herbal products: from experimental design to data analysis.草药产品分子鉴定策略：从实验设计到数据分析

Chin Med. 2022 Mar 22;17(1):38. doi: 10.1186/s13020-022-00590-y.

本文引用的文献

High-throughput monitoring of wild bee diversity and abundance via mitogenomics.通过线粒体基因组学对野生蜜蜂多样性和丰度进行高通量监测。

Methods Ecol Evol. 2015 Sep;6(9):1034-1043. doi: 10.1111/2041-210X.12416. Epub 2015 Jul 6.

A framework for inferring biological communities from environmental DNA.一种从环境DNA推断生物群落的框架。

Ecol Appl. 2016 Sep;26(6):1645-1659. doi: 10.1890/15-1733.1.

Censusing marine eukaryotic diversity in the twenty-first century.21世纪海洋真核生物多样性普查。

Philos Trans R Soc Lond B Biol Sci. 2016 Sep 5;371(1702). doi: 10.1098/rstb.2015.0331.

Preparation of Amplicon Libraries for Metabarcoding of Marine Eukaryotes Using Illumina MiSeq: The Adapter Ligation Method.使用Illumina MiSeq对海洋真核生物进行代谢条形码分析的扩增子文库制备：衔接子连接法

Methods Mol Biol. 2016;1452:209-18. doi: 10.1007/978-1-4939-3774-5_14.

Preparation of Amplicon Libraries for Metabarcoding of Marine Eukaryotes Using Illumina MiSeq: The Dual-PCR Method.使用Illumina MiSeq对海洋真核生物进行代谢条形码分析的扩增子文库制备：双重PCR方法

Methods Mol Biol. 2016;1452:197-207. doi: 10.1007/978-1-4939-3774-5_13.

Indexed PCR Primers Induce Template-Specific Bias in Large-Scale DNA Sequencing Studies.索引PCR引物在大规模DNA测序研究中引发模板特异性偏差。

PLoS One. 2016 Mar 7;11(3):e0148698. doi: 10.1371/journal.pone.0148698. eCollection 2016.

Statistical approaches to account for false-positive errors in environmental DNA samples.统计方法解决环境 DNA 样本中假阳性错误。

Mol Ecol Resour. 2016 May;16(3):673-85. doi: 10.1111/1755-0998.12486. Epub 2015 Dec 12.

High-throughput sequencing and morphology perform equally well for benthic monitoring of marine ecosystems.高通量测序和形态学在海洋生态系统底栖生物监测方面表现同样出色。

Sci Rep. 2015 Sep 10;5:13932. doi: 10.1038/srep13932.

Modeling false positive detections in species occurrence data under different study designs.在不同研究设计下对物种出现数据中的误报检测进行建模。

Ecology. 2015 Feb;96(2):332-9. doi: 10.1890/14-1507.1.

PLoS One. 2015 Jul 8;10(7):e0130324. doi: 10.1371/journal.pone.0130324. eCollection 2015.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

随机抽样导致在Illumina COI宏条形码分析中稀有真核生物操作分类单元的重现性较低。

Random sampling causes the low reproducibility of rare eukaryotic OTUs in Illumina COI metabarcoding.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献