• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SPHINX——一种用于宏基因组序列分类-bin 划分的算法。

SPHINX--an algorithm for taxonomic binning of metagenomic sequences.

机构信息

Bio-sciences R&D Division, TCS Innovation Labs, Tata Consultancy Services Limited, Madhapur, Hyderabad, Andhra Pradesh, India.

出版信息

Bioinformatics. 2011 Jan 1;27(1):22-30. doi: 10.1093/bioinformatics/btq608. Epub 2010 Oct 28.

DOI:10.1093/bioinformatics/btq608
PMID:21030462
Abstract

MOTIVATION

Compared with composition-based binning algorithms, the binning accuracy and specificity of alignment-based binning algorithms is significantly higher. However, being alignment-based, the latter class of algorithms require enormous amount of time and computing resources for binning huge metagenomic datasets. The motivation was to develop a binning approach that can analyze metagenomic datasets as rapidly as composition-based approaches, but nevertheless has the accuracy and specificity of alignment-based algorithms. This article describes a hybrid binning approach (SPHINX) that achieves high binning efficiency by utilizing the principles of both 'composition'- and 'alignment'-based binning algorithms.

RESULTS

Validation results with simulated sequence datasets indicate that SPHINX is able to analyze metagenomic sequences as rapidly as composition-based algorithms. Furthermore, the binning efficiency (in terms of accuracy and specificity of assignments) of SPHINX is observed to be comparable with results obtained using alignment-based algorithms.

AVAILABILITY

A web server for the SPHINX algorithm is available at http://metagenomics.atc.tcs.com/SPHINX/.

摘要

动机

与基于组合的分箱算法相比,基于比对的分箱算法的分箱准确性和特异性显著更高。然而,由于基于比对,后者类算法需要大量的时间和计算资源来对庞大的宏基因组数据集进行分箱。动机是开发一种分箱方法,该方法能够像基于组合的方法一样快速地分析宏基因组数据集,但仍然具有基于比对算法的准确性和特异性。本文描述了一种混合分箱方法(SPHINX),该方法通过利用基于“组合”和“比对”的分箱算法的原理来实现高效的分箱。

结果

使用模拟序列数据集的验证结果表明,SPHINX 能够像基于组合的算法一样快速地分析宏基因组序列。此外,观察到 SPHINX 的分箱效率(在分配的准确性和特异性方面)与使用基于比对的算法获得的结果相当。

可用性

SPHINX 算法的 Web 服务器可在 http://metagenomics.atc.tcs.com/SPHINX/ 上获得。

相似文献

1
SPHINX--an algorithm for taxonomic binning of metagenomic sequences.SPHINX——一种用于宏基因组序列分类-bin 划分的算法。
Bioinformatics. 2011 Jan 1;27(1):22-30. doi: 10.1093/bioinformatics/btq608. Epub 2010 Oct 28.
2
TWARIT: an extremely rapid and efficient approach for phylogenetic classification of metagenomic sequences.TWARIT:一种用于宏基因组序列系统发育分类的极快速有效的方法。
Gene. 2012 Sep 1;505(2):259-65. doi: 10.1016/j.gene.2012.06.014. Epub 2012 Jun 15.
3
A robust and accurate binning algorithm for metagenomic sequences with arbitrary species abundance ratio.一种具有任意物种丰度比的宏基因组序列的健壮且准确的分箱算法。
Bioinformatics. 2011 Jun 1;27(11):1489-95. doi: 10.1093/bioinformatics/btr186. Epub 2011 Apr 14.
4
Binpairs: utilization of Illumina paired-end information for improving efficiency of taxonomic binning of metagenomic sequences.双端序列对:利用Illumina双端测序信息提高宏基因组序列分类分箱的效率
PLoS One. 2014 Dec 31;9(12):e114814. doi: 10.1371/journal.pone.0114814. eCollection 2014.
5
INDUS - a composition-based approach for rapid and accurate taxonomic classification of metagenomic sequences.INDUS-一种基于组合的方法,用于快速准确地对宏基因组序列进行分类。
BMC Genomics. 2011 Nov 30;12 Suppl 3(Suppl 3):S4. doi: 10.1186/1471-2164-12-S3-S4.
6
SOrt-ITEMS: Sequence orthology based approach for improved taxonomic estimation of metagenomic sequences.SOrt-ITEMS:基于序列直系同源性的方法,用于改进宏基因组序列的分类学估计。
Bioinformatics. 2009 Jul 15;25(14):1722-30. doi: 10.1093/bioinformatics/btp317. Epub 2009 May 13.
7
CoMet: a workflow using contig coverage and composition for binning a metagenomic sample with high precision.CoMet:一种使用 contig 覆盖度和组成进行宏基因组样本高精度分箱的工作流程。
BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):571. doi: 10.1186/s12859-017-1967-3.
8
DiScRIBinATE: a rapid method for accurate taxonomic classification of metagenomic sequences.DiScRIBINATE:一种用于宏基因组序列准确分类的快速方法。
BMC Bioinformatics. 2010 Oct 15;11 Suppl 7(Suppl 7):S14. doi: 10.1186/1471-2105-11-S7-S14.
9
CS-SCORE: Rapid identification and removal of human genome contaminants from metagenomic datasets.CS-SCORE:从宏基因组数据集中快速识别和去除人类基因组污染物
Genomics. 2015 Aug;106(2):116-21. doi: 10.1016/j.ygeno.2015.04.005. Epub 2015 May 2.
10
MBMC: An Effective Markov Chain Approach for Binning Metagenomic Reads from Environmental Shotgun Sequencing Projects.MBMC:一种用于对环境鸟枪法测序项目中的宏基因组读数进行分箱的有效马尔可夫链方法。
OMICS. 2016 Aug;20(8):470-9. doi: 10.1089/omi.2016.0081. Epub 2016 Jul 22.

引用本文的文献

1
Metagenomic approaches in microbial ecology: an update on whole-genome and marker gene sequencing analyses.微生物生态学中的宏基因组学方法:全基因组和标记基因测序分析的最新进展。
Microb Genom. 2020 Aug;6(8). doi: 10.1099/mgen.0.000409. Epub 2020 Jul 24.
2
An Integrated Multi-Disciplinary Perspectivefor Addressing Challenges of the Human Gut Microbiome.一种应对人类肠道微生物群挑战的综合多学科视角
Metabolites. 2020 Mar 6;10(3):94. doi: 10.3390/metabo10030094.
3
Conducting metagenomic studies in microbiology and clinical research.在微生物学和临床研究中进行宏基因组学研究。
Appl Microbiol Biotechnol. 2018 Oct;102(20):8629-8646. doi: 10.1007/s00253-018-9209-9. Epub 2018 Aug 4.
4
Spherical: an iterative workflow for assembling metagenomic datasets.球形:组装宏基因组数据集的迭代工作流程。
BMC Bioinformatics. 2018 Jan 24;19(1):20. doi: 10.1186/s12859-018-2028-2.
5
A clinician's guide to microbiome analysis.临床医生微生物组分析指南。
Nat Rev Gastroenterol Hepatol. 2017 Oct;14(10):585-595. doi: 10.1038/nrgastro.2017.97. Epub 2017 Aug 9.
6
Interpreting Microbial Biosynthesis in the Genomic Age: Biological and Practical Considerations.基因组时代微生物生物合成的解读:生物学与实际考量
Mar Drugs. 2017 Jun 6;15(6):165. doi: 10.3390/md15060165.
7
BusyBee Web: metagenomic data analysis by bootstrapped supervised binning and annotation.BusyBee Web:基于自举监督分箱和注释的宏基因组数据分析。
Nucleic Acids Res. 2017 Jul 3;45(W1):W171-W179. doi: 10.1093/nar/gkx348.
8
MBMC: An Effective Markov Chain Approach for Binning Metagenomic Reads from Environmental Shotgun Sequencing Projects.MBMC:一种用于对环境鸟枪法测序项目中的宏基因组读数进行分箱的有效马尔可夫链方法。
OMICS. 2016 Aug;20(8):470-9. doi: 10.1089/omi.2016.0081. Epub 2016 Jul 22.
9
Inferring Intra-Community Microbial Interaction Patterns from Metagenomic Datasets Using Associative Rule Mining Techniques.使用关联规则挖掘技术从宏基因组数据集中推断群落内微生物相互作用模式
PLoS One. 2016 Apr 28;11(4):e0154493. doi: 10.1371/journal.pone.0154493. eCollection 2016.
10
Vikodak--A Modular Framework for Inferring Functional Potential of Microbial Communities from 16S Metagenomic Datasets.Vikodak——一个用于从16S宏基因组数据集中推断微生物群落功能潜力的模块化框架。
PLoS One. 2016 Feb 5;11(2):e0148347. doi: 10.1371/journal.pone.0148347. eCollection 2016.