Suppr超能文献

水星BLASTP:加速蛋白质序列比对

Mercury BLASTP: Accelerating Protein Sequence Alignment.

作者信息

Jacob Arpith, Lancaster Joseph, Buhler Jeremy, Harris Brandon, Chamberlain Roger D

机构信息

Dept. of Computer Science and Engineering, Washington University in St. Louis.

出版信息

ACM Trans Reconfigurable Technol Syst. 2008 Jun;1(2):9. doi: 10.1145/1371579.1371581.

Abstract

Large-scale protein sequence comparison is an important but compute-intensive task in molecular biology. BLASTP is the most popular tool for comparative analysis of protein sequences. In recent years, an exponential increase in the size of protein sequence databases has required either exponentially more running time or a cluster of machines to keep pace. To address this problem, we have designed and built a high-performance FPGA-accelerated version of BLASTP, Mercury BLASTP. In this paper, we describe the architecture of the portions of the application that are accelerated in the FPGA, and we also describe the integration of these FPGA-accelerated portions with the existing BLASTP software. We have implemented Mercury BLASTP on a commodity workstation with two Xilinx Virtex-II 6000 FPGAs. We show that the new design runs 11-15 times faster than software BLASTP on a modern CPU while delivering close to 99% identical results.

摘要

大规模蛋白质序列比较是分子生物学中一项重要但计算密集型的任务。BLASTP是蛋白质序列比较分析中最常用的工具。近年来,蛋白质序列数据库规模呈指数级增长,这要么需要指数级增长的运行时间,要么需要一组机器才能跟上。为了解决这个问题,我们设计并构建了一个高性能的FPGA加速版BLASTP,即Mercury BLASTP。在本文中,我们描述了在FPGA中加速的应用程序部分的架构,还描述了这些FPGA加速部分与现有BLASTP软件的集成。我们在配备两个Xilinx Virtex-II 6000 FPGA的商用工作站上实现了Mercury BLASTP。我们表明,新设计在现代CPU上的运行速度比软件BLASTP快11至15倍,同时提供接近99%的相同结果。

相似文献

1
Mercury BLASTP: Accelerating Protein Sequence Alignment.
ACM Trans Reconfigurable Technol Syst. 2008 Jun;1(2):9. doi: 10.1145/1371579.1371581.
2
CUDA-BLASTP: accelerating BLASTP on CUDA-enabled graphics hardware.
IEEE/ACM Trans Comput Biol Bioinform. 2011 Nov-Dec;8(6):1678-84. doi: 10.1109/TCBB.2011.33.
3
BLASTP-ACC: Parallel Architecture and Hardware Accelerator Design for BLAST-Based Protein Sequence Alignment.
IEEE Trans Biomed Circuits Syst. 2019 Dec;13(6):1771-1782. doi: 10.1109/TBCAS.2019.2943539. Epub 2019 Oct 2.
5
High-performance reconfigurable hardware architecture for restricted Boltzmann machines.
IEEE Trans Neural Netw. 2010 Nov;21(11):1780-92. doi: 10.1109/TNN.2010.2073481. Epub 2010 Sep 20.
6
FHAST: FPGA-Based Acceleration of Bowtie in Hardware.
IEEE/ACM Trans Comput Biol Bioinform. 2015 Sep-Oct;12(5):973-81. doi: 10.1109/TCBB.2015.2405333.
7
Leveraging FPGAs for Accelerating Short Read Alignment.
IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):668-677. doi: 10.1109/TCBB.2016.2535385. Epub 2016 Feb 29.
8
cuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on CPU+GPU.
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jul-Aug;14(4):830-843. doi: 10.1109/TCBB.2015.2489662. Epub 2015 Oct 12.
9
Peptide mass fingerprinting using field-programmable gate arrays.
IEEE Trans Biomed Circuits Syst. 2009 Jun;3(3):142-9. doi: 10.1109/TBCAS.2008.2010945.
10
Novel intelligent real-time position tracking system using FPGA and fuzzy logic.
ISA Trans. 2014 Mar;53(2):402-14. doi: 10.1016/j.isatra.2013.09.003. Epub 2013 Oct 7.

引用本文的文献

2
Molecular Mimicry Impact of the COVID-19 Pandemic: Sequence Homology Between SARS-CoV-2 and Autoimmune Diseases Epitopes.
Immunoinformatics (Amst). 2025 Jun;18. doi: 10.1016/j.immuno.2025.100050. Epub 2025 Mar 13.
6
Genome evolution and diversity of wild and cultivated rice species.
Nat Commun. 2024 Nov 18;15(1):9994. doi: 10.1038/s41467-024-54427-3.
8
Identification of autophagy-related genes ATG18 subfamily genes in potato ( L.) and the role of gene in heat stress.
Front Plant Sci. 2024 Aug 27;15:1439972. doi: 10.3389/fpls.2024.1439972. eCollection 2024.

本文引用的文献

1
Acceleration of Ungapped Extension in Mercury BLAST.
Microprocess Microsyst. 2009 Jun 1;33(4):281-289. doi: 10.1016/j.micpro.2009.02.007.
2
Single Pass Streaming BLAST on FPGAs.
Parallel Comput. 2007 Nov;33(10-11):741-756. doi: 10.1016/j.parco.2007.09.003.
3
Biosequence Similarity Search on the Mercury System.
J VLSI Signal Process Syst Signal Image Video Technol. 2007;49(1):101-121. doi: 10.1007/s11265-007-0087-0.
4
Identifying the conserved network of cis-regulatory sites of a eukaryotic genome.
Proc Natl Acad Sci U S A. 2005 Nov 29;102(48):17400-5. doi: 10.1073/pnas.0505147102. Epub 2005 Nov 21.
5
Genome sequencing in microfabricated high-density picolitre reactors.
Nature. 2005 Sep 15;437(7057):376-80. doi: 10.1038/nature03959. Epub 2005 Jul 31.
6
BLAST: at the core of a powerful and diverse set of sequence analysis tools.
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W20-5. doi: 10.1093/nar/gkh435.
7
High speed homology search with FPGAs.
Pac Symp Biocomput. 2002:271-82.
8
IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices.
Bioinformatics. 1999 Dec;15(12):1000-11. doi: 10.1093/bioinformatics/15.12.1000.
9
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.
10
Local alignment statistics.
Methods Enzymol. 1996;266:460-80. doi: 10.1016/s0076-6879(96)66029-7.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验