Corse Emmanuel, Tougard Christelle, Archambaud-Suard Gaït, Agnèse Jean-François, Messu Mandeng Françoise D, Bilong Bilong Charles F, Duneau David, Zinger Lucie, Chappaz Rémi, Xu Charles C Y, Meglécz Emese, Dubut Vincent
Aix Marseille Univ, Avignon Univ CNRS, IRD, IMBE Marseille France.
Agence de Recherche pour la Biodiversité à la Réunion (ARBRE) Saint-Leu, La Réunion France.
Ecol Evol. 2019 Mar 18;9(8):4603-4620. doi: 10.1002/ece3.5063. eCollection 2019 Apr.
In diet metabarcoding analyses, insufficient taxonomic coverage of PCR primer sets generates false negatives that may dramatically distort biodiversity estimates. In this paper, we investigated the taxonomic coverage and complementarity of three cytochrome oxidase subunit I gene (COI) primer sets based on in silico analyses and we conducted an in vivo evaluation using fecal and spider web samples from different invertivores, environments, and geographic locations. Our results underline the lack of predictability of both the coverage and complementarity of individual primer sets: (a) sharp discrepancies exist observed between in silico and in vivo analyses (to the detriment of in silico analyses); (b) both coverage and complementarity depend greatly on the predator and on the taxonomic level at which preys are considered; (c) primer sets' complementarity is the greatest at fine taxonomic levels (molecular operational taxonomic units [MOTUs] and variants). We then formalized the "one-locus-several-primer-sets" (OLSP) strategy, that is, the use of several primer sets that target the same locus (here the first part of the COI gene) and the same group of taxa (here invertebrates). The proximal aim of the OLSP strategy is to minimize false negatives by increasing total coverage through multiple primer sets. We illustrate that the OLSP strategy is especially relevant from this perspective since distinct variants within the same MOTUs were not equally detected across all primer sets. Furthermore, the OLSP strategy produces largely overlapping and comparable sequences, which cannot be achieved when targeting different loci. This facilitates the use of haplotypic diversity information contained within metabarcoding datasets, for example, for phylogeography and finer analyses of prey-predator interactions.
在饮食代谢条形码分析中,PCR引物组的分类覆盖不足会产生假阴性结果,这可能会极大地扭曲生物多样性估计。在本文中,我们基于计算机模拟分析研究了三种细胞色素氧化酶亚基I基因(COI)引物组的分类覆盖范围和互补性,并使用来自不同食虫动物、环境和地理位置的粪便和蜘蛛网样本进行了体内评估。我们的结果强调了单个引物组在覆盖范围和互补性方面缺乏可预测性:(a)在计算机模拟分析和体内分析之间存在明显差异(对计算机模拟分析不利);(b)覆盖范围和互补性都很大程度上取决于捕食者以及考虑猎物的分类水平;(c)引物组的互补性在精细分类水平(分子操作分类单元[MOTUs]和变体)上最大。然后,我们将“一位点多引物组”(OLSP)策略形式化,即使用针对同一基因座(此处为COI基因的第一部分)和同一类群(此处为无脊椎动物)的多个引物组。OLSP策略的近期目标是通过多个引物组增加总覆盖范围,从而将假阴性结果降至最低。我们表明,从这个角度来看,OLSP策略特别适用,因为在所有引物组中,同一MOTUs内的不同变体并未被同等检测到。此外,OLSP策略产生的序列在很大程度上是重叠且可比的,而针对不同基因座时则无法实现这一点。这便于利用代谢条形码数据集中包含的单倍型多样性信息,例如用于系统地理学以及对猎物 - 捕食者相互作用进行更精细的分析。