• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MPI-blastn和NCBI-TaxCollector:通过高性能分类和广泛的分类归属改进宏基因组分析。

MPI-blastn and NCBI-TaxCollector: improving metagenomic analysis with high performance classification and wide taxonomic attachment.

作者信息

Dias R, Xavier M G, Rossi F D, Neves M V, Lange T A P, Giongo A, De Rose C A F, Triplett E W

机构信息

Department of Microbiology and Cell Science, University of Florida, Florida, United States.

出版信息

J Bioinform Comput Biol. 2014 Jun;12(3):1450013. doi: 10.1142/S0219720014500139.

DOI:10.1142/S0219720014500139
PMID:24969751
Abstract

Metagenomic sequencing technologies are advancing rapidly and the size of output data from high-throughput genetic sequencing has increased substantially over the years. This brings us to a scenario where advanced computational optimizations are requested to perform a metagenomic analysis. In this paper, we describe a new parallel implementation of nucleotide BLAST (MPI-blastn) and a new tool for taxonomic attachment of Basic Local Alignment Search Tool (BLAST) results that supports the NCBI taxonomy (NCBI-TaxCollector). MPI-blastn obtained a high performance when compared to the mpiBLAST and ScalaBLAST. In our best case, MPI-blastn was able to run 408 times faster in 384 cores. Our evaluations demonstrated that NCBI-TaxCollector is able to perform taxonomic attachments 125 times faster and needs 120 times less RAM than the previous TaxCollector. Through our optimizations, a multiple sequence search that currently takes 37 hours can be performed in less than 6 min and a post processing with NCBI taxonomic data attachment, which takes 48 hours, now is able to run in 23 min.

摘要

宏基因组测序技术正在迅速发展,多年来高通量基因测序的输出数据量大幅增加。这使我们面临一种需要先进的计算优化来进行宏基因组分析的情况。在本文中,我们描述了核苷酸BLAST的一种新的并行实现(MPI-blastn)以及一种用于对基本局部比对搜索工具(BLAST)结果进行分类归属的新工具,该工具支持美国国立医学图书馆国家生物技术信息中心(NCBI)的分类法(NCBI-TaxCollector)。与mpiBLAST和ScalaBLAST相比,MPI-blastn具有高性能。在我们的最佳情况下,MPI-blastn在384个核心上的运行速度能够快408倍。我们的评估表明,NCBI-TaxCollector进行分类归属的速度比以前的TaxCollector快125倍,所需的随机存取存储器(RAM)少120倍。通过我们的优化,当前需要37小时的多序列搜索可以在不到6分钟内完成,而使用NCBI分类数据归属的后处理原本需要48小时,现在能够在23分钟内运行。

相似文献

1
MPI-blastn and NCBI-TaxCollector: improving metagenomic analysis with high performance classification and wide taxonomic attachment.MPI-blastn和NCBI-TaxCollector:通过高性能分类和广泛的分类归属改进宏基因组分析。
J Bioinform Comput Biol. 2014 Jun;12(3):1450013. doi: 10.1142/S0219720014500139.
2
G-BLASTN: accelerating nucleotide alignment by graphics processors.G-BLASTN:通过图形处理器加速核苷酸比对。
Bioinformatics. 2014 May 15;30(10):1384-91. doi: 10.1093/bioinformatics/btu047. Epub 2014 Jan 24.
3
A Massively Parallel Sequence Similarity Search for Metagenomic Sequencing Data.宏基因组测序数据的大规模并行序列相似性搜索。
Int J Mol Sci. 2017 Oct 11;18(10):2124. doi: 10.3390/ijms18102124.
4
taxMaps: comprehensive and highly accurate taxonomic classification of short-read data in reasonable time.taxMaps:在合理时间内对短读数据进行全面且高度准确的分类。
Genome Res. 2018 May;28(5):751-758. doi: 10.1101/gr.225276.117. Epub 2018 Mar 27.
5
Scalable metagenomics alignment research tool (SMART): a scalable, rapid, and complete search heuristic for the classification of metagenomic sequences from complex sequence populations.可扩展宏基因组比对研究工具(SMART):一种用于对复杂序列群体中的宏基因组序列进行分类的可扩展、快速且完整的搜索启发式方法。
BMC Bioinformatics. 2016 Jul 28;17:292. doi: 10.1186/s12859-016-1159-6.
6
Re-purposing software for functional characterization of the microbiome.重新利用软件对微生物组进行功能特征分析。
Microbiome. 2021 Jan 9;9(1):4. doi: 10.1186/s40168-020-00971-1.
7
Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.通过验证的视角看宏基因组组装:评估和提高宏基因组组装基因组质量的最新进展。
Brief Bioinform. 2019 Jul 19;20(4):1140-1150. doi: 10.1093/bib/bbx098.
8
SILVA, RDP, Greengenes, NCBI and OTT - how do these taxonomies compare?SILVA、RDP、Greengenes、NCBI和OTT——这些分类法如何比较?
BMC Genomics. 2017 Mar 14;18(Suppl 2):114. doi: 10.1186/s12864-017-3501-4.
9
High speed BLASTN: an accelerated MegaBLAST search tool.高速BLASTN:一种加速的MegaBLAST搜索工具。
Nucleic Acids Res. 2015 Sep 18;43(16):7762-8. doi: 10.1093/nar/gkv784. Epub 2015 Aug 6.
10
MetaCRAM: an integrated pipeline for metagenomic taxonomy identification and compression.MetaCRAM:一种用于宏基因组分类识别和压缩的集成流程。
BMC Bioinformatics. 2016 Feb 19;17:94. doi: 10.1186/s12859-016-0932-x.

引用本文的文献

1
Assessment of soil bacterial communities in integrated crop production systems within the Amazon Biome, Brazil: a comparative study.巴西亚马逊生物群落中综合作物生产系统中土壤细菌群落的评估:一项比较研究。
Braz J Microbiol. 2024 Sep;55(3):2815-2825. doi: 10.1007/s42770-024-01352-8. Epub 2024 May 2.
2
Competition with insectivorous ants as a contributor to low songbird diversity at low elevations in the eastern Himalaya.在东喜马拉雅低海拔地区,与食虫蚁的竞争是导致鸣禽多样性较低的一个因素。
Ecol Evol. 2020 Mar 30;10(10):4280-4290. doi: 10.1002/ece3.6196. eCollection 2020 May.
3
Contrasting diversity of vaginal lactobacilli among the females of Northeast India.
印度东北部女性阴道乳杆菌的多样性差异明显。
BMC Microbiol. 2019 Aug 27;19(1):198. doi: 10.1186/s12866-019-1568-6.
4
Genomic Targets and Features of BarA-UvrY (-SirA) Signal Transduction Systems.BarA-UvrY(-SirA)信号转导系统的基因组靶点与特征
PLoS One. 2015 Dec 16;10(12):e0145035. doi: 10.1371/journal.pone.0145035. eCollection 2015.
5
Different combinations of atomic interactions predict protein-small molecule and protein-DNA/RNA affinities with similar accuracy.不同的原子相互作用组合能够以相似的准确度预测蛋白质与小分子以及蛋白质与DNA/RNA的亲和力。
Proteins. 2015 Nov;83(11):2100-14. doi: 10.1002/prot.24928. Epub 2015 Sep 23.