• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估和优化常用于 DNA 代谢组学序列数据分类的软件的性能。

Evaluating and optimizing the performance of software commonly used for the taxonomic classification of DNA metabarcoding sequence data.

机构信息

Department of Entomology, The Ohio State University-Ohio Agricultural Research and Development Center, 1680 Madison Ave., Wooster, OH, 44691, USA.

Department of Infectious Diseases, The Sahlgrenska Academy, University of Gothenburg, Guldhedsgatan 10, Göteborg, SE-413 46, Sweden.

出版信息

Mol Ecol Resour. 2017 Jul;17(4):760-769. doi: 10.1111/1755-0998.12628. Epub 2016 Nov 21.

DOI:10.1111/1755-0998.12628
PMID:27797448
Abstract

The taxonomic classification of DNA sequences has become a critical component of numerous ecological research applications; however, few studies have evaluated the strengths and weaknesses of commonly used sequence classification approaches. Further, the methods and software available for sequence classification are diverse, creating an environment in which it may be difficult to determine the best course of action and the trade-offs made using different classification approaches. Here, we provide an in silico evaluation of three DNA sequence classifiers, the rdp Naïve Bayesian Classifier, rtax and utax. Further, we discuss the results, merits and limitations of both the classifiers and our method of classifier evaluation. Our methods of comparison are simple, yet robust, and will provide researchers a methodological and conceptual foundation for making such evaluations in a variety of research situations. Generally, we found a considerable trade-off between accuracy and sensitivity for the classifiers tested, indicating a need for further improvement of sequence classification tools.

摘要

DNA 序列的分类学分类已经成为许多生态研究应用的关键组成部分;然而,很少有研究评估常用序列分类方法的优缺点。此外,用于序列分类的方法和软件多种多样,这使得很难确定最佳行动方案以及使用不同分类方法所做出的权衡。在这里,我们对三种 DNA 序列分类器(rdp 朴素贝叶斯分类器、rtax 和 utax)进行了计算机模拟评估。此外,我们还讨论了分类器的结果、优点和局限性,以及我们的分类器评估方法。我们的比较方法简单但稳健,将为研究人员在各种研究情况下进行此类评估提供方法和概念基础。总的来说,我们发现测试的分类器在准确性和灵敏度之间存在相当大的权衡,这表明需要进一步改进序列分类工具。

相似文献

1
Evaluating and optimizing the performance of software commonly used for the taxonomic classification of DNA metabarcoding sequence data.评估和优化常用于 DNA 代谢组学序列数据分类的软件的性能。
Mol Ecol Resour. 2017 Jul;17(4):760-769. doi: 10.1111/1755-0998.12628. Epub 2016 Nov 21.
2
Optimizing taxonomic classification of marker-gene amplicon sequences with QIIME 2's q2-feature-classifier plugin.利用 QIIME 2 的 q2-feature-classifier 插件优化标记基因扩增子序列的分类学分类。
Microbiome. 2018 May 17;6(1):90. doi: 10.1186/s40168-018-0470-z.
3
Just keep it simple? Benchmarking the accuracy of taxonomy assignment software in metabarcoding studies.保持简单?元条形码研究中分类学分配软件准确性的基准测试。
Mol Ecol Resour. 2021 Oct;21(7):2187-2189. doi: 10.1111/1755-0998.13473. Epub 2021 Aug 3.
4
CONSTAX: a tool for improved taxonomic resolution of environmental fungal ITS sequences.CONSTAX:一种用于提高环境真菌ITS序列分类分辨率的工具。
BMC Bioinformatics. 2017 Dec 6;18(1):538. doi: 10.1186/s12859-017-1952-x.
5
A sequential naïve Bayes classifier for DNA barcodes.一种用于DNA条形码的序列朴素贝叶斯分类器。
Stat Appl Genet Mol Biol. 2014 Aug;13(4):423-34. doi: 10.1515/sagmb-2013-0025.
6
Studying long 16S rDNA sequences with ultrafast-metagenomic sequence classification using exact alignments (Kraken).使用精确比对(Kraken)通过超快速宏基因组序列分类研究长16S rDNA序列。
J Microbiol Methods. 2016 Mar;122:38-42. doi: 10.1016/j.mimet.2016.01.011. Epub 2016 Jan 23.
7
Unbiased probabilistic taxonomic classification for DNA barcoding.无偏概率分类在 DNA 条形码分类学中的应用。
Bioinformatics. 2016 Oct 1;32(19):2920-7. doi: 10.1093/bioinformatics/btw346. Epub 2016 Jun 13.
8
Metaxa2 Database Builder: enabling taxonomic identification from metagenomic or metabarcoding data using any genetic marker.Metaxa2 数据库构建器:使用任何遗传标记从宏基因组或代谢组学数据中进行分类鉴定。
Bioinformatics. 2018 Dec 1;34(23):4027-4033. doi: 10.1093/bioinformatics/bty482.
9
Bioinformatic challenges for DNA metabarcoding of plants and animals.动植物 DNA 条形码的生物信息学挑战。
Mol Ecol. 2012 Apr;21(8):1834-47. doi: 10.1111/j.1365-294X.2012.05550.x.
10
A k-mer-based barcode DNA classification methodology based on spectral representation and a neural gas network.一种基于光谱表示和神经气体网络的基于k-mer的条形码DNA分类方法。
Artif Intell Med. 2015 Jul;64(3):173-84. doi: 10.1016/j.artmed.2015.06.002. Epub 2015 Jul 4.

引用本文的文献

1
A Novel Approach for Pollen Identification and Quantification Using Hybrid Capture-Based DNA Metabarcoding.一种基于杂交捕获的DNA宏条形码技术进行花粉鉴定和定量的新方法。
Ecol Evol. 2025 Apr 23;15(4):e71311. doi: 10.1002/ece3.71311. eCollection 2025 Apr.
2
A pile of pipelines: An overview of the bioinformatics software for metabarcoding data analyses.一堆管道:代谢组学数据分析的生物信息学软件概述。
Mol Ecol Resour. 2024 Jul;24(5):e13847. doi: 10.1111/1755-0998.13847. Epub 2023 Aug 7.
3
DNA High-Throughput Sequencing for Arthropod Gut Content Analysis to Evaluate Effectiveness and Safety of Biological Control Agents.
用于节肢动物肠道内容物分析以评估生物防治剂有效性和安全性的DNA高通量测序
Neotrop Entomol. 2023 Apr;52(2):302-332. doi: 10.1007/s13744-022-01011-3. Epub 2022 Dec 7.
4
Metabarcoding of insect-associated fungal communities: a comparison of internal transcribed spacer (ITS) and large-subunit (LSU) rRNA markers.昆虫相关真菌群落的代谢条形码分析:内转录间隔区(ITS)和大亚基(LSU)rRNA标记的比较
MycoKeys. 2022 Mar 8;88:1-33. doi: 10.3897/mycokeys.88.77106. eCollection 2022.
5
Metabarcoding versus mapping unassembled shotgun reads for identification of prey consumed by arthropod epigeal predators.代谢条码测序与未组装的 shotgun 测序在鉴定节肢动物地上捕食者所食猎物中的比较。
Gigascience. 2022 Mar 24;11. doi: 10.1093/gigascience/giac020.
6
Quantification of marine benthic communities with metabarcoding.基于 metabarcoding 技术的海洋底栖生物群落定量分析。
Mol Ecol Resour. 2022 Apr;22(3):1043-1054. doi: 10.1111/1755-0998.13536. Epub 2021 Nov 1.
7
Method for quick DNA barcode reference library construction.快速构建DNA条形码参考文库的方法。
Ecol Evol. 2021 Aug 4;11(17):11627-11638. doi: 10.1002/ece3.7788. eCollection 2021 Sep.
8
Next-Generation Sequencing and Its Impacts on Entomological Research in Ecology and Evolution.下一代测序及其对生态与进化领域昆虫学研究的影响。
Neotrop Entomol. 2021 Oct;50(5):679-696. doi: 10.1007/s13744-021-00895-x. Epub 2021 Aug 10.
9
An efficient and robust laboratory workflow and tetrapod database for larger scale environmental DNA studies.用于更大规模环境 DNA 研究的高效稳健实验室工作流程和四足动物数据库。
Gigascience. 2019 Apr 1;8(4). doi: 10.1093/gigascience/giz029.
10
A reference cytochrome c oxidase subunit I database curated for hierarchical classification of arthropod metabarcoding data.一个经过整理的细胞色素c氧化酶亚基I参考数据库,用于节肢动物代谢条形码数据的层次分类。
PeerJ. 2018 Jun 26;6:e5126. doi: 10.7717/peerj.5126. eCollection 2018.