基于现场可编程门阵列的单通道流式BLAST

Single Pass Streaming BLAST on FPGAs.

作者信息

Herbordt Martin C, Model Josh, Sukhwani Bharat, Gu Yongfeng, Vancourt Tom

机构信息

Department of Electrical and Computer Engineering, Boston University; Boston, MA 02215, Web: http://www.bu.edu/caadlab .

出版信息

Parallel Comput. 2007 Nov;33(10-11):741-756. doi: 10.1016/j.parco.2007.09.003.

DOI:10.1016/j.parco.2007.09.003

PMID:19081828

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2598392/

Abstract

Approximate string matching is fundamental to bioinformatics and has been the subject of numerous FPGA acceleration studies. We address issues with respect to FPGA implementations of both BLAST- and dynamic-programming- (DP) based methods. Our primary contribution is a new algorithm for emulating the seeding and extension phases of BLAST. This operates in a single pass through a database at streaming rate, and with no preprocessing other than loading the query string. Moreover, it emulates parameters turned to maximum possible sensitivity with no slowdown. While current DP-based methods also operate at streaming rate, generating results can be cumbersome. We address this with a new structure for data extraction. We present results from several implementations showing order of magnitude acceleration over serial reference code. A simple extension assures compatibility with NCBI BLAST.

摘要

近似字符串匹配是生物信息学的基础，并且一直是众多FPGA加速研究的主题。我们解决了基于BLAST和动态规划（DP）方法的FPGA实现方面的问题。我们的主要贡献是一种用于模拟BLAST的种子生成和扩展阶段的新算法。该算法以流速率单次遍历数据库，除了加载查询字符串外无需进行预处理。此外，它在不降低速度的情况下模拟了转向最大可能灵敏度的参数。虽然当前基于DP的方法也以流速率运行，但生成结果可能很麻烦。我们通过一种新的数据提取结构来解决这个问题。我们展示了几种实现的结果，表明与串行参考代码相比有数量级的加速。一个简单的扩展确保了与NCBI BLAST的兼容性。

相似文献

Single Pass Streaming BLAST on FPGAs.基于现场可编程门阵列的单通道流式BLAST

Parallel Comput. 2007 Nov;33(10-11):741-756. doi: 10.1016/j.parco.2007.09.003.

Families of FPGA-Based Accelerators for Approximate String Matching.用于近似字符串匹配的基于现场可编程门阵列（FPGA）的加速器系列

Microprocess Microsyst. 2007 Mar 5;31(2):135-145. doi: 10.1016/j.micpro.2006.04.001.

Implementation of a motion estimation algorithm for Intel FPGAs using OpenCL.使用OpenCL为英特尔FPGA实现一种运动估计算法。

J Supercomput. 2023;79(9):9866-9888. doi: 10.1007/s11227-023-05051-3. Epub 2023 Jan 21.

Acceleration of Ungapped Extension in Mercury BLAST.水星BLAST中无间隙延伸的加速

Microprocess Microsyst. 2009 Jun 1;33(4):281-289. doi: 10.1016/j.micpro.2009.02.007.

muBLASTP: database-indexed protein sequence search on multicore CPUs.muBLASTP：基于多核CPU的数据库索引蛋白质序列搜索。

BMC Bioinformatics. 2016 Nov 4;17(1):443. doi: 10.1186/s12859-016-1302-4.

Accelerating string set matching in FPGA hardware for bioinformatics research.在FPGA硬件中加速用于生物信息学研究的字符串集匹配

BMC Bioinformatics. 2008 Apr 15;9:197. doi: 10.1186/1471-2105-9-197.

Hardware-Algorithm Codesign for Fast and Energy Efficient Approximate String Matching on FPGA for Computational Biology.硬件-算法协同设计用于在 FPGA 上进行快速和节能的近似字符串匹配，用于计算生物学。

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:87-90. doi: 10.1109/EMBC48229.2022.9870924.

Distributed large-scale graph processing on FPGAs.基于现场可编程门阵列（FPGA）的分布式大规模图形处理

J Big Data. 2023;10(1):95. doi: 10.1186/s40537-023-00756-x. Epub 2023 Jun 4.

BLASTP-ACC: Parallel Architecture and Hardware Accelerator Design for BLAST-Based Protein Sequence Alignment.BLASTP-ACC：基于 BLAST 的蛋白质序列比对的并行架构和硬件加速器设计。

IEEE Trans Biomed Circuits Syst. 2019 Dec;13(6):1771-1782. doi: 10.1109/TBCAS.2019.2943539. Epub 2019 Oct 2.

High speed homology search with FPGAs.使用现场可编程门阵列进行高速同源性搜索。

Pac Symp Biocomput. 2002:271-82.

引用本文的文献

High speed BLASTN: an accelerated MegaBLAST search tool.高速BLASTN：一种加速的MegaBLAST搜索工具。

Nucleic Acids Res. 2015 Sep 18;43(16):7762-8. doi: 10.1093/nar/gkv784. Epub 2015 Aug 6.

A Systolic Array-Based FPGA Parallel Architecture for the BLAST Algorithm.一种基于 systolic 阵列的用于 BLAST 算法的 FPGA 并行架构。

ISRN Bioinform. 2012 Sep 4;2012:195658. doi: 10.5402/2012/195658. eCollection 2012.

Mercury BLASTP: Accelerating Protein Sequence Alignment.水星BLASTP：加速蛋白质序列比对

ACM Trans Reconfigurable Technol Syst. 2008 Jun;1(2):9. doi: 10.1145/1371579.1371581.

本文引用的文献

Families of FPGA-Based Accelerators for Approximate String Matching.用于近似字符串匹配的基于现场可编程门阵列（FPGA）的加速器系列

Microprocess Microsyst. 2007 Mar 5;31(2):135-145. doi: 10.1016/j.micpro.2006.04.001.

J VLSI Signal Process Syst Signal Image Video Technol. 2007;49(1):101-121. doi: 10.1007/s11265-007-0087-0.

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.空位BLAST和位置特异性迭代BLAST：新一代蛋白质数据库搜索程序。

Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.

New chip may speed genome analysis.新型芯片可能加快基因组分析速度。

Science. 1989 May 12;244(4905):655-6. doi: 10.1126/science.2717944.

Basic local alignment search tool.基本局部比对搜索工具

J Mol Biol. 1990 Oct 5;215(3):403-10. doi: 10.1016/S0022-2836(05)80360-2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验