使用DNA条形码识别标本的算法比较：裸子植物的实例

A comparison of algorithms for the identification of specimens using DNA barcodes: examples from gymnosperms.

作者信息

Little Damon P, Stevenson Dennis Wm

机构信息

Lewis B. and Dorothy Cullman Program for Molecular Systematic Studies, The New York Botanical Garden, Bronx, New York 10458-5126, USA.

出版信息

Cladistics. 2007 Feb;23(1):1-21. doi: 10.1111/j.1096-0031.2006.00126.x.

DOI:10.1111/j.1096-0031.2006.00126.x

PMID:34905841

Abstract

In order to use DNA sequences for specimen identification (e.g., barcoding, fingerprinting) an algorithm to compare query sequences with a reference database is needed. Precision and accuracy of query sequence identification was estimated for hierarchical clustering (parsimony and neighbor joining), similarity methods (BLAST, BLAT and megaBLAST), combined clustering/similarity methods (BLAST/parsimony and BLAST/neighbor joining), diagnostic methods (DNA-BAR and DOME ID), and a new method (ATIM). We offer two novel alignment-free algorithmic solutions (DOME ID and ATIM) to identify query sequences for the purposes of DNA barcoding. Publicly available gymnosperm nrITS 2 and plastid matK sequences were used as test data sets. On the test data sets, almost all of the methods were able to accurately identify sequences to genus; however, no method was able to accurately identify query sequences to species at a frequency that would be considered useful for routine specimen identification (42-71% unambiguously correct). Clustering methods performed the worst (perhaps due to alignment issues). Similarity methods, ATIM, DNA-BAR, and DOME ID all performed at approximately the same level. Given the relative precision of the algorithms (median = 67% unambiguous), the low accuracy of species-level identification observed could be ascribed to the lack of correspondence between patterns of allelic similarity and species delimitations. Application of DNA barcoding to sequences of CITES listed cycads (Cycadopsida) provides an example of the potential application of DNA barcoding to enforcement of conservation laws.

摘要

为了使用DNA序列进行样本鉴定（如条形码技术、指纹识别），需要一种将查询序列与参考数据库进行比较的算法。我们评估了层次聚类法（简约法和邻接法）、相似性方法（BLAST、BLAT和megaBLAST）、聚类/相似性组合方法（BLAST/简约法和BLAST/邻接法）、诊断方法（DNA-BAR和DOME ID）以及一种新方法（ATIM）在查询序列鉴定方面的精度和准确性。我们提供了两种新颖的无比对算法解决方案（DOME ID和ATIM），用于DNA条形码技术目的的查询序列鉴定。公开可用的裸子植物nrITS 2和质体matK序列被用作测试数据集。在测试数据集上，几乎所有方法都能准确地将序列鉴定到属；然而，如果要用于常规样本鉴定（明确正确的频率为42%-71%），没有一种方法能够准确地将查询序列鉴定到种。聚类方法表现最差（可能是由于比对问题）。相似性方法、ATIM、DNA-BAR和DOME ID的表现大致相同。鉴于算法的相对精度（中位数=67%明确），观察到的物种水平鉴定的低准确性可能归因于等位基因相似性模式与物种界定之间缺乏对应关系。将DNA条形码技术应用于《濒危野生动植物种国际贸易公约》（CITES）所列苏铁科（苏铁纲）植物的序列，为DNA条形码技术在保护法执行中的潜在应用提供了一个例子。

相似文献

A comparison of algorithms for the identification of specimens using DNA barcodes: examples from gymnosperms.

Cladistics. 2007 Feb;23(1):1-21. doi: 10.1111/j.1096-0031.2006.00126.x.

DNA barcoding of recently diverged species: relative performance of matching methods.

PLoS One. 2012;7(1):e30490. doi: 10.1371/journal.pone.0030490. Epub 2012 Jan 17.

Two new computational methods for universal DNA barcoding: a benchmark using barcode sequences of bacteria, archaea, animals, fungi, and land plants.

PLoS One. 2013 Oct 18;8(10):e76910. doi: 10.1371/journal.pone.0076910. eCollection 2013.

An assessment of the DNA barcodes of Indian freshwater fishes.

Gene. 2014 Mar 1;537(1):20-8. doi: 10.1016/j.gene.2013.12.047. Epub 2013 Dec 28.

Exposing the illegal trade in cycad species (Cycadophyta: Encephalartos) at two traditional medicine markets in South Africa using DNA barcoding.

Genome. 2016 Sep;59(9):771-81. doi: 10.1139/gen-2016-0032. Epub 2016 Jul 13.

[Molecular identification in genus of Lilium based on DNA barcoding].

Yao Xue Xue Bao. 2014 Dec;49(12):1730-8.

VIP Barcoding: composition vector-based software for rapid species identification based on DNA barcoding.

Mol Ecol Resour. 2014 Jul;14(4):871-81. doi: 10.1111/1755-0998.12235. Epub 2014 Mar 7.

Morphological identification and COI barcodes of adult flies help determine species identities of chironomid larvae (Diptera, Chironomidae).

Bull Entomol Res. 2016 Feb;106(1):34-46. doi: 10.1017/S0007485315000486. Epub 2015 Jun 15.

Species discrimination in Sisyrinchium (Iridaceae): assessment of DNA barcodes in a taxonomically challenging genus.

Mol Ecol Resour. 2014 Mar;14(2):324-35. doi: 10.1111/1755-0998.12182. Epub 2013 Nov 11.

FuzzyID2: A software package for large data set species identification via barcoding and metabarcoding using hidden Markov models and fuzzy set methods.

Mol Ecol Resour. 2018 May;18(3):666-675. doi: 10.1111/1755-0998.12738. Epub 2017 Dec 10.

引用本文的文献

The DNA barcode identification of Dalbergia odorifera T. Chen and Dalbergia tonkinensis Prain.

BMC Plant Biol. 2023 Nov 7;23(1):546. doi: 10.1186/s12870-023-04513-3.

Implementation of machine learning in DNA barcoding for determining the plant family taxonomy.

Heliyon. 2023 Sep 21;9(10):e20161. doi: 10.1016/j.heliyon.2023.e20161. eCollection 2023 Oct.

Phylotranscriptomics Shed Light on Intrageneric Relationships and Historical Biogeography of (Cycadales).

Plants (Basel). 2023 Jan 19;12(3):478. doi: 10.3390/plants12030478.

What's left in the tank? Identification of non-ascribed aquarium's coral collections with DNA barcodes as part of an integrated diagnostic approach.

Conserv Genet Resour. 2022;14(2):167-182. doi: 10.1007/s12686-021-01250-3. Epub 2022 Jan 11.

Applied Barcoding: The Practicalities of DNA Testing for Herbals.

Plants (Basel). 2020 Sep 4;9(9):1150. doi: 10.3390/plants9091150.

Figures of merit and statistics for detecting faulty species identification with DNA barcodes: A case study in Ramaria and related fungal genera.

PLoS One. 2020 Aug 19;15(8):e0237507. doi: 10.1371/journal.pone.0237507. eCollection 2020.

Phylogenomic Approaches to DNA Barcoding of Herbal Medicines: Developing Clade-Specific Diagnostic Characters for .

Front Plant Sci. 2019 May 14;10:586. doi: 10.3389/fpls.2019.00586. eCollection 2019.

DNA barcoding of , the most taxonomically complicated genus of Papaveraceae.

Ecol Evol. 2019 Jan 21;9(4):1934-1945. doi: 10.1002/ece3.4886. eCollection 2019 Feb.

Comparative Biology of Cycad Pollen, Seed and Tissue - A Plant Conservation Perspective.

Bot Rev. 2018;84(3):295-314. doi: 10.1007/s12229-018-9203-z. Epub 2018 Jul 5.

The Application and Limitation of Universal Chloroplast Markers in Discriminating East Asian Evergreen Oaks.

Front Plant Sci. 2018 May 8;9:569. doi: 10.3389/fpls.2018.00569. eCollection 2018.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用DNA条形码识别标本的算法比较：裸子植物的实例

A comparison of algorithms for the identification of specimens using DNA barcodes: examples from gymnosperms.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献