Lancaster Joseph, Buhler Jeremy, Chamberlain Roger D
Department of Computer Science and Engineering, Washington University in St. Louis, E-mail:
Microprocess Microsyst. 2009 Jun 1;33(4):281-289. doi: 10.1016/j.micpro.2009.02.007.
The amount of biosequence data being produced each year is growing exponentially. Extracting useful information from this massive amount of data efficiently is becoming an increasingly difficult task. There are many available software tools that molecular biologists use for comparing genomic data. This paper focuses on accelerating the most widely used such tool, BLAST. Mercury BLAST takes a streaming approach to the BLAST computation by off loading the performance-critical sections to specialized hardware. This hardware is then used in combination with the processor of the host system to deliver BLAST results in a fraction of the time of the general-purpose processor alone.This paper presents the design of the ungapped extension stage of Mercury BLAST. The architecture of the ungapped extension stage is described along with the context of this stage within the Mercury BLAST system. The design is compact and runs at 100 MHz on available FPGAs, making it an effective and powerful component for accelerating biosequence comparisons. The performance of this stage is 25× that of the standard software distribution, yielding close to 50× performance improvement on the complete BLAST application. The sensitivity is essentially equivalent to that of the standard distribution.
每年产生的生物序列数据量正呈指数级增长。从如此海量的数据中高效提取有用信息正变得越来越困难。分子生物学家有许多可用于比较基因组数据的软件工具。本文着重于加速此类最广泛使用的工具——BLAST。Mercury BLAST通过将性能关键部分卸载到专用硬件上,采用流式方法进行BLAST计算。然后,该硬件与主机系统的处理器结合使用,从而能在仅使用通用处理器所需时间的一小部分内得出BLAST结果。本文介绍了Mercury BLAST的无间隙延伸阶段的设计。描述了无间隙延伸阶段的架构以及该阶段在Mercury BLAST系统中的背景情况。该设计紧凑,在现有的现场可编程门阵列(FPGA)上以100兆赫兹运行,使其成为加速生物序列比较的有效且强大的组件。该阶段的性能是标准软件版本的25倍,在完整的BLAST应用程序上实现了近50倍的性能提升。其灵敏度与标准版本基本相当。