Suppr超能文献

美国全国广播公司最新消息:将病毒和真菌数据库添加到朴素贝叶斯分类工具中。

NBC update: The addition of viral and fungal databases to the Naïve Bayes classification tool.

作者信息

Rosen Gail L, Lim Tze Yee

机构信息

Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA.

出版信息

BMC Res Notes. 2012 Jan 31;5:81. doi: 10.1186/1756-0500-5-81.

Abstract

BACKGROUND

Classifying the fungal and viral content of a sample is an important component of analyzing microbial communities in environmental media. Therefore, a method to classify any fragment from these organisms' DNA should be implemented.

RESULTS

We update the näive Bayes classification (NBC) tool to classify reads originating from viral and fungal organisms. NBC classifies a fungal dataset similarly to Basic Local Alignment Search Tool (BLAST) and the Ribosomal Database Project (RDP) classifier. We also show NBC's similarities and differences to RDP on a fungal large subunit (LSU) ribosomal DNA dataset. For viruses in the training database, strain classification accuracy is 98%, while for those reads originating from sequences not in the database, the order-level accuracy is 78%, where order indicates the taxonomic level in the tree of life.

CONCLUSIONS

In addition to being competitive to other classifiers available, NBC has the potential to handle reads originating from any location in the genome. We recommend using the Bacteria/Archaea, Fungal, and Virus databases separately due to algorithmic biases towards long genomes. The tool is publicly available at: http://nbc.ece.drexel.edu.

摘要

背景

对样本中的真菌和病毒成分进行分类是分析环境介质中微生物群落的重要组成部分。因此,应实施一种对这些生物体DNA的任何片段进行分类的方法。

结果

我们更新了朴素贝叶斯分类(NBC)工具,以对源自病毒和真菌生物体的 reads 进行分类。NBC 对真菌数据集的分类与基本局部比对搜索工具(BLAST)和核糖体数据库项目(RDP)分类器类似。我们还展示了 NBC 在真菌大亚基(LSU)核糖体 DNA 数据集上与 RDP 的异同。对于训练数据库中的病毒,菌株分类准确率为 98%,而对于那些源自数据库中不存在序列的 reads,目级准确率为 78%,其中目表示生命之树中的分类级别。

结论

除了与其他可用分类器具有竞争力外,NBC 还有潜力处理源自基因组中任何位置的 reads。由于算法对长基因组存在偏差,我们建议分别使用细菌/古菌、真菌和病毒数据库。该工具可在以下网址公开获取:http://nbc.ece.drexel.edu

相似文献

2
NBC: the Naive Bayes Classification tool webserver for taxonomic classification of metagenomic reads.
Bioinformatics. 2011 Jan 1;27(1):127-9. doi: 10.1093/bioinformatics/btq619. Epub 2010 Nov 8.
3
Comparison of statistical methods to classify environmental genomic fragments.
IEEE Trans Nanobioscience. 2010 Dec;9(4):310-6. doi: 10.1109/TNB.2010.2081375. Epub 2010 Sep 27.
4
Accurate, rapid taxonomic classification of fungal large-subunit rRNA genes.
Appl Environ Microbiol. 2012 Mar;78(5):1523-33. doi: 10.1128/AEM.06826-11. Epub 2011 Dec 22.
5
Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy.
Appl Environ Microbiol. 2007 Aug;73(16):5261-7. doi: 10.1128/AEM.00062-07. Epub 2007 Jun 22.
9
Metagenome fragment classification using N-mer frequency profiles.
Adv Bioinformatics. 2008;2008:205969. doi: 10.1155/2008/205969. Epub 2008 Nov 16.
10
Using the RDP classifier to predict taxonomic novelty and reduce the search space for finding novel organisms.
PLoS One. 2012;7(3):e32491. doi: 10.1371/journal.pone.0032491. Epub 2012 Mar 5.

引用本文的文献

1
Machine Learning and Deep Learning Applications in Metagenomic Taxonomy and Functional Annotation.
Front Microbiol. 2022 Mar 14;13:811495. doi: 10.3389/fmicb.2022.811495. eCollection 2022.
2
Comprehensive benchmarking and ensemble approaches for metagenomic classifiers.
Genome Biol. 2017 Sep 21;18(1):182. doi: 10.1186/s13059-017-1299-7.
3
Protein signature-based estimation of metagenomic abundances including all domains of life and viruses.
Bioinformatics. 2013 Apr 15;29(8):973-80. doi: 10.1093/bioinformatics/btt077. Epub 2013 Feb 15.

本文引用的文献

2
NBC: the Naive Bayes Classification tool webserver for taxonomic classification of metagenomic reads.
Bioinformatics. 2011 Jan 1;27(1):127-9. doi: 10.1093/bioinformatics/btq619. Epub 2010 Nov 8.
3
Indoor fungal composition is geographically patterned and more diverse in temperate zones than in the tropics.
Proc Natl Acad Sci U S A. 2010 Aug 3;107(31):13748-53. doi: 10.1073/pnas.1000454107. Epub 2010 Jun 28.
5
Characterization of the oral fungal microbiome (mycobiome) in healthy individuals.
PLoS Pathog. 2010 Jan 8;6(1):e1000713. doi: 10.1371/journal.ppat.1000713.
6
Metagenome fragment classification using N-mer frequency profiles.
Adv Bioinformatics. 2008;2008:205969. doi: 10.1155/2008/205969. Epub 2008 Nov 16.
7
Microbial community profiling for human microbiome projects: Tools, techniques, and challenges.
Genome Res. 2009 Jul;19(7):1141-52. doi: 10.1101/gr.085464.108. Epub 2009 Apr 21.
8
A software pipeline for processing and identification of fungal ITS sequences.
Source Code Biol Med. 2009 Jan 15;4:1. doi: 10.1186/1751-0473-4-1.
9
MetaSim: a sequencing simulator for genomics and metagenomics.
PLoS One. 2008 Oct 8;3(10):e3373. doi: 10.1371/journal.pone.0003373.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验