Liu Yanlei, Chen Kai, Wang Lihu, Yu Xinqiang, Xu Chao, Suo Zhili, Zhou Shiliang, Shi Shuo, Dong Wenpan
School of Landscape and Ecological Engineering, Hebei University of Engineering, Handan 056038, China.
State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing 100093, China.
Plant Divers. 2024 Oct 16;47(1):115-126. doi: 10.1016/j.pld.2024.10.002. eCollection 2025 Jan.
DNA barcoding has been extensively used for species identification. However, species identification of mixed samples or degraded DNA is limited by current DNA barcoding methods. In this study, we use plant species in Juglandaceae to evaluate an assembly-free reads accurate identification (AFRAID) method of species identification, a novel approach for precise species identification in plants. Specifically, we determined (1) the accuracy of DNA barcoding approaches in delimiting species in Juglandaceae, (2) the minimum size of chloroplast dataset for species discrimination, and (3) minimum amount of next generation sequencing (NGS) data required for species identification. We found that species identification rates were highest when whole chloroplast genomes were used, followed by taxon-specific DNA barcodes, and then universal DNA barcodes. Species identification of 100% was achieved when chloroplast genome sequence coverage reached 20% and the original sequencing data reached 500,000 reads. AFRAID accurately identified species for all samples tested after 500,000 clean reads, with far less computing time than common approaches. These results provide a new approach to accurately identify species, overcoming limitations of traditional DNA barcodes. Our method, which uses next generation sequencing to generate partial chloroplast genomes, reveals that DNA barcode regions are not necessarily fixed, accelerating the process of species identification.
DNA条形码已被广泛用于物种鉴定。然而,混合样本或降解DNA的物种鉴定受到当前DNA条形码方法的限制。在本研究中,我们使用胡桃科植物物种来评估一种免组装 reads 精确鉴定(AFRAID)的物种鉴定方法,这是一种用于植物精确物种鉴定的新方法。具体而言,我们确定了:(1)DNA条形码方法在界定胡桃科物种中的准确性;(2)用于物种区分的叶绿体数据集的最小规模;以及(3)物种鉴定所需的下一代测序(NGS)数据的最小量。我们发现,使用整个叶绿体基因组时物种鉴定率最高,其次是特定分类群的DNA条形码,然后是通用DNA条形码。当叶绿体基因组序列覆盖率达到20%且原始测序数据达到500,000条 reads 时,物种鉴定成功率达到100%。在500,000条clean reads之后,AFRAID能准确鉴定所有测试样本的物种,且计算时间远少于常用方法。这些结果提供了一种准确鉴定物种的新方法,克服了传统DNA条形码的局限性。我们的方法利用下一代测序生成部分叶绿体基因组,揭示了DNA条形码区域不一定是固定的,从而加快了物种鉴定的进程。