• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

WEVOTE:微生物序列的加权投票分类鉴定方法

WEVOTE: Weighted Voting Taxonomic Identification Method of Microbial Sequences.

作者信息

Metwally Ahmed A, Dai Yang, Finn Patricia W, Perkins David L

机构信息

Department of Bioengineering, University of Illinois at Chicago, Chicago, IL, United States of America.

Department of Medicine, University of Illinois at Chicago, Chicago, IL, United States of America.

出版信息

PLoS One. 2016 Sep 28;11(9):e0163527. doi: 10.1371/journal.pone.0163527. eCollection 2016.

DOI:10.1371/journal.pone.0163527
PMID:27683082
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5040256/
Abstract

BACKGROUND

Metagenome shotgun sequencing presents opportunities to identify organisms that may prevent or promote disease. The analysis of sample diversity is achieved by taxonomic identification of metagenomic reads followed by generating an abundance profile. Numerous tools have been developed based on different design principles. Tools achieving high precision can lack sensitivity in some applications. Conversely, tools with high sensitivity can suffer from low precision and require long computation time.

METHODS

In this paper, we present WEVOTE (WEighted VOting Taxonomic idEntification), a method that classifies metagenome shotgun sequencing DNA reads based on an ensemble of existing methods using k-mer-based, marker-based, and naive-similarity based approaches. Our evaluation on fourteen benchmarking datasets shows that WEVOTE improves the classification precision by reducing false positive annotations while preserving a high level of sensitivity.

CONCLUSIONS

WEVOTE is an efficient and automated tool that combines multiple individual taxonomic identification methods to produce more precise and sensitive microbial profiles. WEVOTE is developed primarily to identify reads generated by MetaGenome Shotgun sequencing. It is expandable and has the potential to incorporate additional tools to produce a more accurate taxonomic profile. WEVOTE was implemented using C++ and shell scripting and is available at www.github.com/aametwally/WEVOTE.

摘要

背景

宏基因组鸟枪法测序为识别可能预防或引发疾病的生物体提供了机会。通过对宏基因组读数进行分类鉴定并生成丰度图谱来实现样本多样性分析。基于不同的设计原则已开发出众多工具。在某些应用中,实现高精度的工具可能缺乏灵敏度。相反,具有高灵敏度的工具可能存在精度低且计算时间长的问题。

方法

在本文中,我们提出了WEVOTE(加权投票分类鉴定法),这是一种基于现有方法的集合,使用基于k-mer、基于标记和基于朴素相似度的方法对宏基因组鸟枪法测序DNA读数进行分类的方法。我们在14个基准数据集上的评估表明,WEVOTE通过减少假阳性注释提高了分类精度,同时保持了较高的灵敏度。

结论

WEVOTE是一种高效的自动化工具,它结合了多种个体分类鉴定方法,以生成更精确和灵敏的微生物图谱。WEVOTE主要用于识别由宏基因组鸟枪法测序产生的读数。它具有可扩展性,有潜力纳入更多工具以生成更准确的分类图谱。WEVOTE使用C++和 shell脚本实现,可在www.github.com/aametwally/WEVOTE获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/eb6c48452644/pone.0163527.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/b6ffdc9abd9c/pone.0163527.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/62ce9689d749/pone.0163527.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/e120effb28a8/pone.0163527.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/86f46833a18c/pone.0163527.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/16646e0f8041/pone.0163527.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/eb6c48452644/pone.0163527.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/b6ffdc9abd9c/pone.0163527.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/62ce9689d749/pone.0163527.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/e120effb28a8/pone.0163527.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/86f46833a18c/pone.0163527.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/16646e0f8041/pone.0163527.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e630/5040256/eb6c48452644/pone.0163527.g006.jpg

相似文献

1
WEVOTE: Weighted Voting Taxonomic Identification Method of Microbial Sequences.WEVOTE:微生物序列的加权投票分类鉴定方法
PLoS One. 2016 Sep 28;11(9):e0163527. doi: 10.1371/journal.pone.0163527. eCollection 2016.
2
Evaluation of taxonomic classification and profiling methods for long-read shotgun metagenomic sequencing datasets.评价长读 shotgun 宏基因组测序数据集的分类和分析方法。
BMC Bioinformatics. 2022 Dec 13;23(1):541. doi: 10.1186/s12859-022-05103-0.
3
Comprehensive benchmarking and ensemble approaches for metagenomic classifiers.元基因组分类器的综合基准测试和集成方法。
Genome Biol. 2017 Sep 21;18(1):182. doi: 10.1186/s13059-017-1299-7.
4
CONSULT-II: accurate taxonomic identification and profiling using locality-sensitive hashing.CONSULT-II:基于位置敏感哈希的准确分类鉴定和特征分析。
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae150.
5
Selection of marker genes for genetic barcoding of microorganisms and binning of metagenomic reads by Barcoder software tools.微生物遗传条形码标记基因的选择和 Barcoder 软件工具对宏基因组读段的分类。
BMC Bioinformatics. 2018 Aug 30;19(1):309. doi: 10.1186/s12859-018-2320-1.
6
Deep learning models for bacteria taxonomic classification of metagenomic data.基于深度学习的宏基因组数据细菌分类学分类模型
BMC Bioinformatics. 2018 Jul 9;19(Suppl 7):198. doi: 10.1186/s12859-018-2182-6.
7
Kelpie: generating full-length 'amplicons' from whole-metagenome datasets.凯尔皮:从全宏基因组数据集中生成全长“扩增子”。
PeerJ. 2019 Jan 30;6:e6174. doi: 10.7717/peerj.6174. eCollection 2019.
8
TAMPA: interpretable analysis and visualization of metagenomics-based taxon abundance profiles.坦帕:基于宏基因组的分类丰度图谱的可解释分析和可视化。
Gigascience. 2022 Dec 28;12. doi: 10.1093/gigascience/giad008. Epub 2023 Feb 28.
9
MetaPalette: a -mer Painting Approach for Metagenomic Taxonomic Profiling and Quantification of Novel Strain Variation.MetaPalette:用于宏基因组分类分析和新型菌株变异定量的α- mer绘画方法。
mSystems. 2016 Jun 7;1(3). doi: 10.1128/mSystems.00020-16. eCollection 2016 May-Jun.
10
PanFP: pangenome-based functional profiles for microbial communities.PanFP:基于全基因组的微生物群落功能概况
BMC Res Notes. 2015 Sep 26;8:479. doi: 10.1186/s13104-015-1462-8.

引用本文的文献

1
Challenges and insights in the exploration of the low abundance human ocular surface microbiome.探索低丰度人类眼表微生物组所面临的挑战和启示。
Front Cell Infect Microbiol. 2023 Sep 1;13:1232147. doi: 10.3389/fcimb.2023.1232147. eCollection 2023.
2
MetaPro: a scalable and reproducible data processing and analysis pipeline for metatranscriptomic investigation of microbial communities.MetaPro:用于微生物群落宏转录组学研究的可扩展且可重复的数据处理和分析管道。
Microbiome. 2023 Jun 27;11(1):143. doi: 10.1186/s40168-023-01562-6.
3
POSMM: an efficient alignment-free metagenomic profiler that complements alignment-based profiling.

本文引用的文献

1
Fast and sensitive taxonomic classification for metagenomics with Kaiju.使用Kaiju对宏基因组学进行快速且灵敏的分类学分类。
Nat Commun. 2016 Apr 13;7:11257. doi: 10.1038/ncomms11257.
2
An evaluation of the accuracy and speed of metagenome analysis tools.宏基因组分析工具的准确性和速度评估。
Sci Rep. 2016 Jan 18;6:19233. doi: 10.1038/srep19233.
3
MetaPhlAn2 for enhanced metagenomic taxonomic profiling.用于增强宏基因组分类分析的MetaPhlAn2
POSMM:一种高效的无比对宏基因组分析工具,可补充基于比对的分析。
Environ Microbiome. 2023 Mar 8;18(1):16. doi: 10.1186/s40793-023-00476-y.
4
Drastic reduction of false positive species in samples of insects by intersecting the default output of two popular metagenomic classifiers.通过交叉比较两种流行的宏基因组分类器的默认输出,大幅减少昆虫样本中的假阳性物种。
PLoS One. 2022 Oct 25;17(10):e0275790. doi: 10.1371/journal.pone.0275790. eCollection 2022.
5
Robust identification of temporal biomarkers in longitudinal omics studies.纵向组学研究中时间生物标志物的稳健识别。
Bioinformatics. 2022 Aug 2;38(15):3802-3811. doi: 10.1093/bioinformatics/btac403.
6
Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.宏基因组学音乐——应用、分析流程及其相关工具的综述。
Funct Integr Genomics. 2022 Feb;22(1):3-26. doi: 10.1007/s10142-021-00810-y. Epub 2021 Oct 18.
7
Pediatric lung transplantation: Dynamics of the microbiome and bronchiolitis obliterans in cystic fibrosis.小儿肺移植:囊性纤维化中微生物组和闭塞性细支气管炎的动态变化。
J Heart Lung Transplant. 2020 Aug;39(8):824-834. doi: 10.1016/j.healun.2020.04.016. Epub 2020 May 19.
8
The olfactory G protein-coupled receptor (Olfr-78/OR51E2) modulates the intestinal response to colitis.嗅觉 G 蛋白偶联受体(Olfr-78/OR51E2)调节肠道对结肠炎的反应。
Am J Physiol Cell Physiol. 2020 Mar 1;318(3):C502-C513. doi: 10.1152/ajpcell.00454.2019. Epub 2020 Jan 8.
9
Serotonin Transporter Deficiency is Associated with Dysbiosis and Changes in Metabolic Function of the Mouse Intestinal Microbiome.血清素转运体缺乏与肠道微生物组的失调和代谢功能变化有关。
Sci Rep. 2019 Feb 14;9(1):2138. doi: 10.1038/s41598-019-38489-8.
10
Utilizing longitudinal microbiome taxonomic profiles to predict food allergy via Long Short-Term Memory networks.利用纵向微生物组分类特征通过长短时记忆网络预测食物过敏。
PLoS Comput Biol. 2019 Feb 4;15(2):e1006693. doi: 10.1371/journal.pcbi.1006693. eCollection 2019 Feb.
Nat Methods. 2015 Oct;12(10):902-3. doi: 10.1038/nmeth.3589.
4
CLARK: fast and accurate classification of metagenomic and genomic sequences using discriminative k-mers.克拉克:使用判别性k-mer对宏基因组和基因组序列进行快速准确分类
BMC Genomics. 2015 Mar 25;16(1):236. doi: 10.1186/s12864-015-1419-2.
5
TIPP: taxonomic identification and phylogenetic profiling.TIPP:分类鉴定和系统发育分析。
Bioinformatics. 2014 Dec 15;30(24):3548-55. doi: 10.1093/bioinformatics/btu721. Epub 2014 Oct 29.
6
Kraken: ultrafast metagenomic sequence classification using exact alignments.克拉肯:使用精确比对的超快速宏基因组序列分类
Genome Biol. 2014 Mar 3;15(3):R46. doi: 10.1186/gb-2014-15-3-r46.
7
Metagenomic species profiling using universal phylogenetic marker genes.基于通用系统发育标记基因的宏基因组物种分析。
Nat Methods. 2013 Dec;10(12):1196-9. doi: 10.1038/nmeth.2693. Epub 2013 Oct 20.
8
Scalable metagenomic taxonomy classification using a reference genome database.基于参考基因组数据库的可扩展宏基因组分类学分类。
Bioinformatics. 2013 Sep 15;29(18):2253-60. doi: 10.1093/bioinformatics/btt389. Epub 2013 Jul 4.
9
Metagenomic microbial community profiling using unique clade-specific marker genes.基于独特进化枝特异性标记基因的宏基因组微生物群落分析。
Nat Methods. 2012 Jun 10;9(8):811-4. doi: 10.1038/nmeth.2066.
10
SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.SATe-II:一种非常快速且准确的同时估计多个序列比对和系统发育树的方法。
Syst Biol. 2012 Jan;61(1):90-106. doi: 10.1093/sysbio/syr095. Epub 2011 Dec 1.