当前BLAST软件在核苷酸序列上的比较

Comparison of Current BLAST Software on Nucleotide Sequences.

作者信息

Elizabeth Cha I, Rouchka Eric C

机构信息

University of Louisville Department of Computer Engineering and Computer Science, Louisville, KY 40292,

出版信息

Proc IPDPS (Conf). 2005 Apr 4;19:8. doi: 10.1109/IPDPS.2005.145.

DOI:10.1109/IPDPS.2005.145

PMID:21243090

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3021256/

Abstract

The computational power needed for searching exponentially growing databases, such as GenBank, has increased dramatically. Three different implementations of the most widely used sequence alignment tool, known as BLAST (Basic Local Alignment Search Tool), are studied for their efficiency on nucleotide-nucleotide comparisons. The performance of these implementations are evaluated using target databases and query sequences of varying lengths and number of entries constructed from human genomic and EST sequences. In general, WU BLAST was found to be most efficient when the database and query composition are unknown. NCBI BLAST appears to work best when the database contains a small number of sequences, while mpiBLAST shows the power of database distribution when the number of bases per target database is large. The optimal number of compute nodes in mpiBLAST varies depending upon the database, yet in the cases studied, remains surprisingly low.

摘要

搜索如GenBank这样呈指数增长的数据库所需的计算能力已大幅提高。针对最广泛使用的序列比对工具BLAST（基本局部比对搜索工具）的三种不同实现方式，研究了它们在核苷酸-核苷酸比较方面的效率。使用从人类基因组和EST序列构建的不同长度和条目的目标数据库及查询序列来评估这些实现方式的性能。总体而言，当数据库和查询组成未知时，发现WU BLAST效率最高。当数据库包含少量序列时，NCBI BLAST似乎效果最佳，而当每个目标数据库的碱基数量很大时，mpiBLAST则展现出数据库分布式计算的优势。mpiBLAST中计算节点的最佳数量因数据库而异，但在所研究的案例中，该数量仍低得出奇。

相似文献

Comparison of Current BLAST Software on Nucleotide Sequences.

Proc IPDPS (Conf). 2005 Apr 4;19:8. doi: 10.1109/IPDPS.2005.145.

[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].

Yi Chuan Xue Bao. 2004 May;31(5):431-43.

Profiling the BLAST bioinformatics application for load balancing on high-performance computing clusters.

BMC Bioinformatics. 2022 Dec 16;23(1):544. doi: 10.1186/s12859-022-05029-7.

muBLASTP: database-indexed protein sequence search on multicore CPUs.

BMC Bioinformatics. 2016 Nov 4;17(1):443. doi: 10.1186/s12859-016-1302-4.

BLAST+: architecture and applications.

BMC Bioinformatics. 2009 Dec 15;10:421. doi: 10.1186/1471-2105-10-421.

G-BLASTN: accelerating nucleotide alignment by graphics processors.

Bioinformatics. 2014 May 15;30(10):1384-91. doi: 10.1093/bioinformatics/btu047. Epub 2014 Jan 24.

BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences.

FEMS Microbiol Lett. 1999 May 15;174(2):247-50. doi: 10.1111/j.1574-6968.1999.tb13575.x.

iBLAST: Incremental BLAST of new sequences via automated e-value correction.

PLoS One. 2021 Apr 22;16(4):e0249410. doi: 10.1371/journal.pone.0249410. eCollection 2021.

Massively Parallel Implementation of Sequence Alignment with Basic Local Alignment Search Tool Using Parallel Computing in Java Library.

J Comput Biol. 2018 Aug;25(8):871-881. doi: 10.1089/cmb.2018.0079. Epub 2018 Jul 13.

BLAST: improvements for better sequence analysis.

Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W6-9. doi: 10.1093/nar/gkl164.

引用本文的文献

Profiling the BLAST bioinformatics application for load balancing on high-performance computing clusters.

BMC Bioinformatics. 2022 Dec 16;23(1):544. doi: 10.1186/s12859-022-05029-7.

Comparative in silico characterization of Klebsiella pneumoniae hypervirulent plasmids and their antimicrobial resistance genes.

Ann Clin Microbiol Antimicrob. 2022 Jun 2;21(1):23. doi: 10.1186/s12941-022-00514-6.

The Complete Mitogenome of and Insights Into Its Evolutionary Pattern Based on Simple Repeat Sequences of Seed Plant Mitogenomes.

Front Plant Sci. 2022 Jan 26;12:802321. doi: 10.3389/fpls.2021.802321. eCollection 2021.

A systematic study of the whole genome sequence of strain 239 provides an insight into its physiological and taxonomic properties which correlate with its position in the genus.

Synth Syst Biotechnol. 2016 Sep 1;1(3):169-186. doi: 10.1016/j.synbio.2016.05.001. eCollection 2016 Sep.

Novel domain combinations in proteins encoded by chimeric transcripts.

Bioinformatics. 2012 Jun 15;28(12):i67-74. doi: 10.1093/bioinformatics/bts216.

Chimeras taking shape: potential functions of proteins encoded by chimeric RNA transcripts.

Genome Res. 2012 Jul;22(7):1231-42. doi: 10.1101/gr.130062.111. Epub 2012 May 15.

Genome-wide analysis of the heat shock transcription factors in Populus trichocarpa and Medicago truncatula.

Mol Biol Rep. 2012 Feb;39(2):1877-86. doi: 10.1007/s11033-011-0933-9. Epub 2011 May 29.

本文引用的文献

BLAST: at the core of a powerful and diverse set of sequence analysis tools.

Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W20-5. doi: 10.1093/nar/gkh435.

The genome sequence and structure of rice chromosome 1.

Nature. 2002 Nov 21;420(6913):312-6. doi: 10.1038/nature01184.

Fast algorithms for large-scale genome alignment and comparison.

Nucleic Acids Res. 2002 Jun 1;30(11):2478-83. doi: 10.1093/nar/30.11.2478.

BLAT--the BLAST-like alignment tool.

Genome Res. 2002 Apr;12(4):656-64. doi: 10.1101/gr.229202.

Protein sequence similarity searches using patterns as seeds.

Nucleic Acids Res. 1998 Sep 1;26(17):3986-90. doi: 10.1093/nar/26.17.3986.

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.

dbEST--database for "expressed sequence tags".

Nat Genet. 1993 Aug;4(4):332-3. doi: 10.1038/ng0893-332.

The turning point in genome research.

Trends Biochem Sci. 1995 Aug;20(8):295-6. doi: 10.1016/s0968-0004(00)89051-9.

A general method applicable to the search for similarities in the amino acid sequence of two proteins.

J Mol Biol. 1970 Mar;48(3):443-53. doi: 10.1016/0022-2836(70)90057-4.

Fluorescence detection in automated DNA sequence analysis.

Nature. 1986;321(6071):674-9. doi: 10.1038/321674a0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

当前BLAST软件在核苷酸序列上的比较

Comparison of Current BLAST Software on Nucleotide Sequences.

作者信息

Elizabeth Cha I, Rouchka Eric C

机构信息

University of Louisville Department of Computer Engineering and Computer Science, Louisville, KY 40292,

出版信息

Proc IPDPS (Conf). 2005 Apr 4;19:8. doi: 10.1109/IPDPS.2005.145.

DOI:10.1109/IPDPS.2005.145

PMID:21243090

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3021256/

Abstract

摘要

当前BLAST软件在核苷酸序列上的比较

Comparison of Current BLAST Software on Nucleotide Sequences.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

当前BLAST软件在核苷酸序列上的比较

Comparison of Current BLAST Software on Nucleotide Sequences.

作者信息

机构信息

出版信息