整合分类阶元与分类内变异性的 DNA 条码序列鉴定。

DNA barcode sequence identification incorporating taxonomic hierarchy and within taxon variability.

机构信息

Lewis B. and Dorothy Cullman Program for Molecular Systematics, The New York Botanical Garden, Bronx, New York, United States of America.

出版信息

PLoS One. 2011;6(8):e20552. doi: 10.1371/journal.pone.0020552. Epub 2011 Aug 16.

DOI:10.1371/journal.pone.0020552

PMID:21857897

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3156709/

Abstract

For DNA barcoding to succeed as a scientific endeavor an accurate and expeditious query sequence identification method is needed. Although a global multiple-sequence alignment can be generated for some barcoding markers (e.g. COI, rbcL), not all barcoding markers are as structurally conserved (e.g. matK). Thus, algorithms that depend on global multiple-sequence alignments are not universally applicable. Some sequence identification methods that use local pairwise alignments (e.g. BLAST) are unable to accurately differentiate between highly similar sequences and are not designed to cope with hierarchic phylogenetic relationships or within taxon variability. Here, I present a novel alignment-free sequence identification algorithm--BRONX--that accounts for observed within taxon variability and hierarchic relationships among taxa. BRONX identifies short variable segments and corresponding invariant flanking regions in reference sequences. These flanking regions are used to score variable regions in the query sequence without the production of a global multiple-sequence alignment. By incorporating observed within taxon variability into the scoring procedure, misidentifications arising from shared alleles/haplotypes are minimized. An explicit treatment of more inclusive terminals allows for separate identifications to be made for each taxonomic level and/or for user-defined terminals. BRONX performs better than all other methods when there is imperfect overlap between query and reference sequences (e.g. mini-barcode queries against a full-length barcode database). BRONX consistently produced better identifications at the genus-level for all query types.

摘要

为了使 DNA 条形码技术在科学研究中取得成功，需要一种准确、快速的查询序列识别方法。虽然可以为某些条形码标记（如 COI、rbcL）生成全局多序列比对，但并非所有条形码标记都具有相同的结构保守性（如 matK）。因此，依赖全局多序列比对的算法并不普遍适用。一些使用局部比对的序列识别方法（如 BLAST）无法准确区分高度相似的序列，也无法应对层次系统发育关系或分类群内的变异性。在这里，我提出了一种新颖的无比对序列识别算法——BRONX，它可以考虑到分类群内的变异性和分类群之间的层次关系。BRONX 识别参考序列中的短变异片段和相应的不变侧翼区域。这些侧翼区域用于在不生成全局多序列比对的情况下对查询序列中的可变区域进行评分。通过将分类群内的变异性纳入评分过程，可以最大限度地减少由于共享等位基因/单倍型引起的错误识别。对更具包容性终端的明确处理允许对每个分类水平进行单独识别，或者对用户定义的终端进行单独识别。当查询和参考序列之间存在不完全重叠时（例如，针对全长条形码数据库的迷你条形码查询），BRONX 的性能优于所有其他方法。BRONX 始终能够为所有查询类型在属级水平产生更好的识别结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be39/3156709/281b8bc8fe1a/pone.0020552.g001.jpg

相似文献

DNA barcode sequence identification incorporating taxonomic hierarchy and within taxon variability.

PLoS One. 2011;6(8):e20552. doi: 10.1371/journal.pone.0020552. Epub 2011 Aug 16.

Two new computational methods for universal DNA barcoding: a benchmark using barcode sequences of bacteria, archaea, animals, fungi, and land plants.

PLoS One. 2013 Oct 18;8(10):e76910. doi: 10.1371/journal.pone.0076910. eCollection 2013.

DNA barcoding the Canadian Arctic flora: core plastid barcodes (rbcL + matK) for 490 vascular plant species.

PLoS One. 2013 Oct 22;8(10):e77982. doi: 10.1371/journal.pone.0077982. eCollection 2013.

DNA barcoding in plants: evolution and applications of in silico approaches and resources.

Mol Phylogenet Evol. 2013 Jun;67(3):631-41. doi: 10.1016/j.ympev.2013.03.002. Epub 2013 Mar 13.

DNA barcoding the native flowering plants and conifers of Wales.

PLoS One. 2012;7(6):e37945. doi: 10.1371/journal.pone.0037945. Epub 2012 Jun 6.

DNA barcoding of arid wild plants using rbcL gene sequences.

Genet Mol Res. 2012 Jul 19;11(3):1934-41. doi: 10.4238/2012.July.19.12.

Assessment of BOLD and GenBank - Their accuracy and reliability for the identification of biological materials.

PLoS One. 2019 Jun 19;14(6):e0217084. doi: 10.1371/journal.pone.0217084. eCollection 2019.

Discriminating plants using the DNA barcode rbcLb: an appraisal based on a large data set.

Mol Ecol Resour. 2014 Mar;14(2):336-43. doi: 10.1111/1755-0998.12185. Epub 2013 Nov 22.

Automated DNA-based plant identification for large-scale biodiversity assessment.

Mol Ecol Resour. 2015 Jan;15(1):136-52. doi: 10.1111/1755-0998.12256. Epub 2014 Apr 12.

A protocol for obtaining DNA barcodes from plant and insect fragments isolated from forensic-type soils.

Int J Legal Med. 2018 Nov;132(6):1515-1526. doi: 10.1007/s00414-018-1772-1. Epub 2018 Feb 8.

引用本文的文献

DNA Barcode Authentication of Devil's Claw Herbal Dietary Supplements.

Plants (Basel). 2021 Sep 24;10(10):2005. doi: 10.3390/plants10102005.

Applied Barcoding: The Practicalities of DNA Testing for Herbals.

Plants (Basel). 2020 Sep 4;9(9):1150. doi: 10.3390/plants9091150.

The Application and Limitation of Universal Chloroplast Markers in Discriminating East Asian Evergreen Oaks.

Front Plant Sci. 2018 May 8;9:569. doi: 10.3389/fpls.2018.00569. eCollection 2018.

Re-evaluation of the discriminatory power of DNA barcoding on some specimens of African Cyprinidae (subfamilies Cyprininae and Danioninae).

Zookeys. 2018 Mar 26(746):105-121. doi: 10.3897/zookeys.746.13502. eCollection 2018.

Alignment-free sequence comparison: benefits, applications, and tools.

Genome Biol. 2017 Oct 3;18(1):186. doi: 10.1186/s13059-017-1319-7.

Utility of DNA barcoding to identify rare endemic vascular plant species in Trinidad.

Ecol Evol. 2017 Aug 9;7(18):7311-7333. doi: 10.1002/ece3.3220. eCollection 2017 Sep.

Field-based species identification of closely-related plants using real-time nanopore sequencing.

Sci Rep. 2017 Aug 21;7(1):8345. doi: 10.1038/s41598-017-08461-5.

MISSEL: a method to identify a large number of small species-specific genomic subsequences and its application to viruses classification.

BioData Min. 2016 Dec 6;9:38. doi: 10.1186/s13040-016-0116-2. eCollection 2016.

Bamboo tea: reduction of taxonomic complexity and application of DNA diagnostics based on and sequence data.

PeerJ. 2016 Dec 8;4:e2781. doi: 10.7717/peerj.2781. eCollection 2016.

Using Next-Generation Sequencing for DNA Barcoding: Capturing Allelic Variation in ITS2.

G3 (Bethesda). 2017 Jan 5;7(1):19-29. doi: 10.1534/g3.116.036145.

本文引用的文献

METHODS FOR FASTER PARSIMONY ANALYSIS.

Cladistics. 1996 Sep;12(3):199-220. doi: 10.1111/j.1096-0031.1996.tb00009.x.

A comparison of algorithms for the identification of specimens using DNA barcodes: examples from gymnosperms.

Cladistics. 2007 Feb;23(1):1-21. doi: 10.1111/j.1096-0031.2006.00126.x.

Analyzing Large Data Sets in Reasonable Times: Solutions for Composite Optima.

Cladistics. 1999 Dec;15(4):415-428. doi: 10.1111/j.1096-0031.1999.tb00278.x.

The Parsimony Ratchet, a New Method for Rapid Parsimony Analysis.

Cladistics. 1999 Dec;15(4):407-414. doi: 10.1111/j.1096-0031.1999.tb00277.x.

Clarification of the relationship beteen Apiaceae and Araliaceae based on matK and rbcL sequence data.

Am J Bot. 1997 Apr;84(4):565.

Molecular systematics of Malpighiaceae: evidence from plastid rbcL and matK sequences.

Am J Bot. 2001 Oct;88(10):1847-62.

Phylogenetics of Cranichideae with emphasis on Spiranthinae (Orchidaceae, Orchidoideae): evidence from plastid and nuclear DNA sequences.

Am J Bot. 2003 May;90(5):777-95. doi: 10.3732/ajb.90.5.777.

Molecular phylogenetics of Meliaceae (Sapindales) based on nuclear and plastid DNA sequences.

Am J Bot. 2003 Mar;90(3):471-80. doi: 10.3732/ajb.90.3.471.

An expanded plastid DNA phylogeny of Orchidaceae and analysis of jackknife branch support strategy.

Am J Bot. 2004 Jan;91(1):149-57. doi: 10.3732/ajb.91.1.149.

An overview of the phylogenetic relationships within Epidendroideae inferred from multiple DNA regions and recircumscription of Epidendreae and Arethuseae (Orchidaceae).

Am J Bot. 2005 Apr;92(4):613-24. doi: 10.3732/ajb.92.4.613.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

整合分类阶元与分类内变异性的 DNA 条码序列鉴定。

DNA barcode sequence identification incorporating taxonomic hierarchy and within taxon variability.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献