Stazione Zoologica Anton Dohrn, Napoli, Italy.
Orléans, France.
Mol Ecol Resour. 2021 Feb;21(2):495-510. doi: 10.1111/1755-0998.13264. Epub 2020 Oct 15.
Measuring biological diversity is a crucial but difficult undertaking, as exemplified in oaks where complex patterns of morphological, ecological, biogeographical and genetic differentiation collide with traditional taxonomy, which measures biodiversity in number of species (or higher taxa). In this pilot study, we generated high-throughput sequencing amplicon data of the intergenic spacer of the 5S nuclear ribosomal DNA cistron (5S-IGS) in oaks, using six mock samples that differ in geographical origin, species composition and pool complexity. The potential of the marker for automated genotaxonomy applications was assessed using a reference data set of 1,770 5S-IGS cloned sequences, covering the entire taxonomic breadth and distribution range of western Eurasian Quercus, and applying similarity (blast) and evolutionary approaches (maximum-likelihood trees and Evolutionary Placement Algorithm). Both methods performed equally well, allowing correct identification of species in sections Ilex and Cerris in the pure and mixed samples, and main lineages shared by species of sect. Quercus. Application of different cut-off thresholds revealed that medium- to high-abundance (>10 or 25) sequences suffice for a net species identification of samples containing one or a few individuals. Lower thresholds identify phylogenetic correspondence with all target species in highly mixed samples (analogous to environmental bulk samples) and include rare variants pointing towards reticulation, incomplete lineage sorting, pseudogenic 5S units and in situ (natural) contamination. Our pipeline is highly promising for future assessments of intraspecific and interpopulation diversity, and of the genetic resources of natural ecosystems, which are fundamental to empower fast and solid biodiversity conservation programmes worldwide.
衡量生物多样性是一项至关重要但具有挑战性的任务,这在橡树中得到了例证,其中形态学、生态学、生物地理学和遗传分化的复杂模式与传统的分类学发生冲突,后者用物种数量(或更高分类单元)来衡量生物多样性。在这项初步研究中,我们使用来自不同地理起源、物种组成和池复杂度的六个模拟样本,生成了 5S 核核糖体 DNA 顺反子(5S-IGS)基因间 spacer 的高通量测序扩增子数据。使用涵盖了整个西欧栎属的分类广度和分布范围的 1770 个 5S-IGS 克隆序列的参考数据集,我们评估了该标记在自动化基因分类应用中的潜力,并应用相似性(blast)和进化方法(最大似然树和进化定位算法)。这两种方法的表现都同样出色,允许在纯样本和混合样本中正确识别 Ilex 和 Cerris 组中的物种,以及 Quercus 组中物种共享的主要谱系。应用不同的截断阈值表明,中高丰度(>10 或 25)序列足以对包含一个或几个个体的样本进行有效的物种识别。较低的阈值则可以识别出与高度混合样本中的所有目标物种具有系统发育对应关系(类似于环境批量样本),并包括指向杂交、不完全谱系分选、假基因 5S 单位和原位(自然)污染的稀有变体。我们的方法对于未来评估种内和种群间多样性以及自然生态系统的遗传资源具有很高的前景,这对于在全球范围内实现快速而稳健的生物多样性保护计划至关重要。