• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用 MetaBin 实现宏基因组序列的快速、准确分类学赋值。

Fast and accurate taxonomic assignments of metagenomic sequences using MetaBin.

机构信息

Laboratory for MetaSystems Research, Quantitative Biology Center, RIKEN, Yokohama, Kanagawa, Japan.

出版信息

PLoS One. 2012;7(4):e34030. doi: 10.1371/journal.pone.0034030. Epub 2012 Apr 4.

DOI:10.1371/journal.pone.0034030
PMID:22496776
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3319535/
Abstract

Taxonomic assignment of sequence reads is a challenging task in metagenomic data analysis, for which the present methods mainly use either composition- or homology-based approaches. Though the homology-based methods are more sensitive and accurate, they suffer primarily due to the time needed to generate the Blast alignments. We developed the MetaBin program and web server for better homology-based taxonomic assignments using an ORF-based approach. By implementing Blat as the faster alignment method in place of Blastx, the analysis time has been reduced by severalfold. It is benchmarked using both simulated and real metagenomic datasets, and can be used for both single and paired-end sequence reads of varying lengths (≥45 bp). To our knowledge, MetaBin is the only available program that can be used for the taxonomic binning of short reads (<100 bp) with high accuracy and high sensitivity using a homology-based approach. The MetaBin web server can be used to carry out the taxonomic analysis, by either submitting reads or Blastx output. It provides several options including construction of taxonomic trees, creation of a composition chart, functional analysis using COGs, and comparative analysis of multiple metagenomic datasets. MetaBin web server and a standalone version for high-throughput analysis are available freely at http://metabin.riken.jp/.

摘要

序列读取的分类分配是宏基因组数据分析中的一项具有挑战性的任务,目前的方法主要使用基于组成或同源性的方法。虽然基于同源性的方法更敏感和准确,但它们主要由于生成 Blast 比对所需的时间而受到影响。我们开发了 MetaBin 程序和网络服务器,以便使用基于 ORF 的方法进行更好的基于同源性的分类分配。通过实现 Blat 作为更快的比对方法来替代 Blastx,分析时间已经减少了几倍。它使用模拟和真实的宏基因组数据集进行了基准测试,并且可以用于不同长度(≥45 bp)的单端和双端序列读取。据我们所知,MetaBin 是唯一可用的程序,可以使用基于同源性的方法对短读取(<100 bp)进行高精度和高灵敏度的分类。MetaBin 网络服务器可以通过提交读取或 Blastx 输出来进行分类分析。它提供了几种选项,包括构建分类树、创建组成图表、使用 COGs 进行功能分析以及比较多个宏基因组数据集。MetaBin 网络服务器和用于高通量分析的独立版本可在 http://metabin.riken.jp/ 免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e986/3319535/74c7e42a28a7/pone.0034030.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e986/3319535/87d31861115c/pone.0034030.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e986/3319535/74c7e42a28a7/pone.0034030.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e986/3319535/87d31861115c/pone.0034030.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e986/3319535/74c7e42a28a7/pone.0034030.g002.jpg

相似文献

1
Fast and accurate taxonomic assignments of metagenomic sequences using MetaBin.利用 MetaBin 实现宏基因组序列的快速、准确分类学赋值。
PLoS One. 2012;7(4):e34030. doi: 10.1371/journal.pone.0034030. Epub 2012 Apr 4.
2
INDUS - a composition-based approach for rapid and accurate taxonomic classification of metagenomic sequences.INDUS-一种基于组合的方法,用于快速准确地对宏基因组序列进行分类。
BMC Genomics. 2011 Nov 30;12 Suppl 3(Suppl 3):S4. doi: 10.1186/1471-2164-12-S3-S4.
3
MEGAN-LR: new algorithms allow accurate binning and easy interactive exploration of metagenomic long reads and contigs.MEGAN-LR:新算法允许对宏基因组长读段和 contigs 进行准确的分箱和轻松的交互式探索。
Biol Direct. 2018 Apr 20;13(1):6. doi: 10.1186/s13062-018-0208-7.
4
Rapid identification of high-confidence taxonomic assignments for metagenomic data.快速鉴定宏基因组数据的高可信度分类学分配。
Nucleic Acids Res. 2012 Aug;40(14):e111. doi: 10.1093/nar/gks335. Epub 2012 Apr 24.
5
SPHINX--an algorithm for taxonomic binning of metagenomic sequences.SPHINX——一种用于宏基因组序列分类-bin 划分的算法。
Bioinformatics. 2011 Jan 1;27(1):22-30. doi: 10.1093/bioinformatics/btq608. Epub 2010 Oct 28.
6
TWARIT: an extremely rapid and efficient approach for phylogenetic classification of metagenomic sequences.TWARIT:一种用于宏基因组序列系统发育分类的极快速有效的方法。
Gene. 2012 Sep 1;505(2):259-65. doi: 10.1016/j.gene.2012.06.014. Epub 2012 Jun 15.
7
Binpairs: utilization of Illumina paired-end information for improving efficiency of taxonomic binning of metagenomic sequences.双端序列对:利用Illumina双端测序信息提高宏基因组序列分类分箱的效率
PLoS One. 2014 Dec 31;9(12):e114814. doi: 10.1371/journal.pone.0114814. eCollection 2014.
8
A statistical framework for accurate taxonomic assignment of metagenomic sequencing reads.一种用于宏基因组测序reads 精确分类学分配的统计框架。
PLoS One. 2012;7(10):e46450. doi: 10.1371/journal.pone.0046450. Epub 2012 Oct 1.
9
WebMGA: a customizable web server for fast metagenomic sequence analysis.WebMGA:一个可定制的快速宏基因组序列分析网络服务器。
BMC Genomics. 2011 Sep 7;12:444. doi: 10.1186/1471-2164-12-444.
10
DiScRIBinATE: a rapid method for accurate taxonomic classification of metagenomic sequences.DiScRIBINATE:一种用于宏基因组序列准确分类的快速方法。
BMC Bioinformatics. 2010 Oct 15;11 Suppl 7(Suppl 7):S14. doi: 10.1186/1471-2105-11-S7-S14.

引用本文的文献

1
From air to insight: the evolution of airborne DNA sequencing technologies.从空中到洞察:机载DNA测序技术的演变
Microbiology (Reading). 2025 May;171(5). doi: 10.1099/mic.0.001564.
2
Application of artificial intelligence approaches to predict the metabolism of xenobiotic molecules by human gut microbiome.应用人工智能方法预测人类肠道微生物群对外源生物分子的代谢。
Front Microbiol. 2023 Dec 5;14:1254073. doi: 10.3389/fmicb.2023.1254073. eCollection 2023.
3
Bacterial and Archaeal Viruses of Himalayan Hot Springs at Manikaran Modulate Host Genomes.

本文引用的文献

1
Database resources of the National Center for Biotechnology Information.美国国立生物技术信息中心的数据库资源。
Nucleic Acids Res. 2011 Jan;39(Database issue):D38-51. doi: 10.1093/nar/gkq1172. Epub 2010 Nov 21.
2
NBC: the Naive Bayes Classification tool webserver for taxonomic classification of metagenomic reads.NBC:用于宏基因组读取分类的朴素贝叶斯分类工具网络服务器。
Bioinformatics. 2011 Jan 1;27(1):127-9. doi: 10.1093/bioinformatics/btq619. Epub 2010 Nov 8.
3
A human gut microbial gene catalogue established by metagenomic sequencing.
马尼卡兰喜马拉雅温泉中的细菌和古菌病毒对宿主基因组进行调控。
Front Microbiol. 2018 Dec 14;9:3095. doi: 10.3389/fmicb.2018.03095. eCollection 2018.
4
A clinician's guide to microbiome analysis.临床医生微生物组分析指南。
Nat Rev Gastroenterol Hepatol. 2017 Oct;14(10):585-595. doi: 10.1038/nrgastro.2017.97. Epub 2017 Aug 9.
5
BusyBee Web: metagenomic data analysis by bootstrapped supervised binning and annotation.BusyBee Web:基于自举监督分箱和注释的宏基因组数据分析。
Nucleic Acids Res. 2017 Jul 3;45(W1):W171-W179. doi: 10.1093/nar/gkx348.
6
Comprehensive strategy for the design of precision drugs and identification of genetic signature behind proneness of the disease-a pharmacogenomic approach.精准药物设计及疾病易感性背后基因特征识别的综合策略——一种药物基因组学方法
Funct Integr Genomics. 2017 Jul;17(4):375-385. doi: 10.1007/s10142-017-0559-7. Epub 2017 May 3.
7
MetaTreeMap: An Alternative Visualization Method for Displaying Metagenomic Phylogenic Trees.MetaTreeMap:一种用于展示宏基因组系统发育树的替代可视化方法。
PLoS One. 2016 Jun 23;11(6):e0158261. doi: 10.1371/journal.pone.0158261. eCollection 2016.
8
Reconstruction of Bacterial and Viral Genomes from Multiple Metagenomes.从多个宏基因组重建细菌和病毒基因组
Front Microbiol. 2016 Apr 12;7:469. doi: 10.3389/fmicb.2016.00469. eCollection 2016.
9
Evaluation of shotgun metagenomics sequence classification methods using in silico and in vitro simulated communities.使用计算机模拟和体外模拟群落评估鸟枪法宏基因组学序列分类方法
BMC Bioinformatics. 2015 Nov 4;16:363. doi: 10.1186/s12859-015-0788-5.
10
Validation of high throughput sequencing and microbial forensics applications.高通量测序及微生物法医学应用的验证
Investig Genet. 2014 Jul 30;5:9. doi: 10.1186/2041-2223-5-9. eCollection 2014.
宏基因组测序建立的人类肠道微生物基因目录。
Nature. 2010 Mar 4;464(7285):59-65. doi: 10.1038/nature08821.
4
WebCARMA: a web application for the functional and taxonomic classification of unassembled metagenomic reads.WebCARMA:一个用于未组装宏基因组读取的功能和分类学分类的网络应用程序。
BMC Bioinformatics. 2009 Dec 18;10:430. doi: 10.1186/1471-2105-10-430.
5
MetaBioME: a database to explore commercially useful enzymes in metagenomic datasets.MetaBioME:一个在宏基因组数据集中探索具有商业用途的酶的数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D468-72. doi: 10.1093/nar/gkp1001. Epub 2009 Nov 11.
6
Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov models.Phymm和PhymmBL:基于插值马尔可夫模型的宏基因组系统发育分类
Nat Methods. 2009 Sep;6(9):673-6. doi: 10.1038/nmeth.1358. Epub 2009 Aug 2.
7
SOrt-ITEMS: Sequence orthology based approach for improved taxonomic estimation of metagenomic sequences.SOrt-ITEMS:基于序列直系同源性的方法,用于改进宏基因组序列的分类学估计。
Bioinformatics. 2009 Jul 15;25(14):1722-30. doi: 10.1093/bioinformatics/btp317. Epub 2009 May 13.
8
TACOA: taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach.TACOA:使用核化最近邻方法对环境基因组片段进行分类学分类。
BMC Bioinformatics. 2009 Feb 11;10:56. doi: 10.1186/1471-2105-10-56.
9
Genome of an endosymbiont coupling N2 fixation to cellulolysis within protist cells in termite gut.一种将固氮与白蚁肠道原生生物细胞内纤维素分解相耦合的内共生体的基因组。
Science. 2008 Nov 14;322(5904):1108-9. doi: 10.1126/science.1165578.
10
MetaSim: a sequencing simulator for genomics and metagenomics.MetaSim:一款用于基因组学和宏基因组学的测序模拟器。
PLoS One. 2008 Oct 8;3(10):e3373. doi: 10.1371/journal.pone.0003373.