用于近似字符串匹配的基于现场可编程门阵列（FPGA）的加速器系列

Families of FPGA-Based Accelerators for Approximate String Matching.

作者信息

Van Court Tom, Herbordt Martin C

机构信息

Department of Electrical and Computer Engineering Boston University.

出版信息

Microprocess Microsyst. 2007 Mar 5;31(2):135-145. doi: 10.1016/j.micpro.2006.04.001.

DOI:10.1016/j.micpro.2006.04.001

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3096528/

Abstract

Dynamic programming for approximate string matching is a large family of different algorithms, which vary significantly in purpose, complexity, and hardware utilization. Many implementations have reported impressive speed-ups, but have typically been point solutions - highly specialized and addressing only one or a few of the many possible options. The problem to be solved is creating a hardware description that implements a broad range of behavioral options without losing efficiency due to feature bloat. We report a set of three component types that address different parts of the approximate string matching problem. This allows each application to choose the feature set required, then make maximum use of the FPGA fabric according to that application's specific resource requirements. Multiple, interchangeable implementations are available for each component type. We show that these methods allow the efficient generation of a large, if not complete, family of accelerators for this application. This flexibility was obtained while retaining high performance: We have evaluated a sample against serial reference codes and found speed-ups of from 150× to 400× over a high-end PC.

摘要

用于近似字符串匹配的动态规划是一大类不同的算法，它们在目的、复杂度和硬件利用率方面有很大差异。许多实现都报告了令人印象深刻的加速效果，但通常都是针对性的解决方案——高度专业化，只解决众多可能选项中的一个或几个。要解决的问题是创建一种硬件描述，它能实现广泛的行为选项，同时不会因功能臃肿而降低效率。我们报告了一组三种组件类型，它们分别解决近似字符串匹配问题的不同部分。这使得每个应用程序都可以选择所需的功能集，然后根据该应用程序的特定资源需求充分利用FPGA架构。每种组件类型都有多个可互换的实现方式。我们表明，这些方法能够高效地生成针对该应用程序的大量（即使不是全部）加速器。在保持高性能的同时获得了这种灵活性：我们将一个样本与串行参考代码进行了评估，发现与高端PC相比加速了150倍至400倍。

相似文献

1

Families of FPGA-Based Accelerators for Approximate String Matching.用于近似字符串匹配的基于现场可编程门阵列（FPGA）的加速器系列

Microprocess Microsyst. 2007 Mar 5;31(2):135-145. doi: 10.1016/j.micpro.2006.04.001.

2

Hardware-Algorithm Codesign for Fast and Energy Efficient Approximate String Matching on FPGA for Computational Biology.硬件-算法协同设计用于在 FPGA 上进行快速和节能的近似字符串匹配，用于计算生物学。

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:87-90. doi: 10.1109/EMBC48229.2022.9870924.

3

A survey of field programmable gate array (FPGA)-based graph convolutional neural network accelerators: challenges and opportunities.基于现场可编程门阵列（FPGA）的图卷积神经网络加速器综述：挑战与机遇

PeerJ Comput Sci. 2022 Nov 28;8:e1166. doi: 10.7717/peerj-cs.1166. eCollection 2022.

4

Single Pass Streaming BLAST on FPGAs.基于现场可编程门阵列的单通道流式BLAST

Parallel Comput. 2007 Nov;33(10-11):741-756. doi: 10.1016/j.parco.2007.09.003.

5

libFLASM: a software library for fixed-length approximate string matching.libFLASM：一个用于固定长度近似字符串匹配的软件库。

BMC Bioinformatics. 2016 Nov 10;17(1):454. doi: 10.1186/s12859-016-1320-2.

6

Programming and Runtime Support to FPGA Accelerator Deployment at Datacenter Scale.面向数据中心规模的FPGA加速器部署的编程与运行时支持。

Proc ACM Symp Cloud Comput. 2016 Oct;2016:456-469. doi: 10.1145/2987550.2987569.

7

Hardware-Software Codesign Based Accelerated and Reconfigurable Methodology for String Matching in Computational Bioinformatics Applications.基于软硬件协同设计的计算生物信息学中字符串匹配的加速可重构方法。

IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1198-1210. doi: 10.1109/TCBB.2018.2885296. Epub 2018 Dec 10.

8

A hybrid short read mapping accelerator.一种混合短读映射加速器。

BMC Bioinformatics. 2013 Feb 26;14:67. doi: 10.1186/1471-2105-14-67.

9

Implementation of a motion estimation algorithm for Intel FPGAs using OpenCL.使用OpenCL为英特尔FPGA实现一种运动估计算法。

J Supercomput. 2023;79(9):9866-9888. doi: 10.1007/s11227-023-05051-3. Epub 2023 Jan 21.

10

An OpenCL-Based FPGA Accelerator for Faster R-CNN.一种基于OpenCL的用于更快区域卷积神经网络（Faster R-CNN）的现场可编程门阵列（FPGA）加速器。

Entropy (Basel). 2022 Sep 23;24(10):1346. doi: 10.3390/e24101346.

引用本文的文献

1

Proposal of Smith-Waterman algorithm on FPGA to accelerate the forward and backtracking steps.基于 FPGA 的 Smith-Waterman 算法加速前向和回溯步骤的提案。

PLoS One. 2022 Jun 30;17(6):e0254736. doi: 10.1371/journal.pone.0254736. eCollection 2022.

2

Single Pass Streaming BLAST on FPGAs.基于现场可编程门阵列的单通道流式BLAST

Parallel Comput. 2007 Nov;33(10-11):741-756. doi: 10.1016/j.parco.2007.09.003.

本文引用的文献

1

A general method applicable to the search for similarities in the amino acid sequence of two proteins.一种适用于寻找两种蛋白质氨基酸序列相似性的通用方法。

J Mol Biol. 1970 Mar;48(3):443-53. doi: 10.1016/0022-2836(70)90057-4.

2

New chip may speed genome analysis.新型芯片可能加快基因组分析速度。

Science. 1989 May 12;244(4905):655-6. doi: 10.1126/science.2717944.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验