• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SWIFOLD:基于OpenCL在FPGA上实现的用于长DNA序列的史密斯-沃特曼算法

SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences.

作者信息

Rucci Enzo, Garcia Carlos, Botella Guillermo, De Giusti Armando, Naiouf Marcelo, Prieto-Matias Manuel

机构信息

III-LIDI, CONICET, Facultad de Informática, Universidad Nacional de La Plata, La Plata (Buenos Aires), 1900, Argentina.

Depto. Arquitectura de Computadores y Automática, Universidad Complutense de Madrid, Madrid, 28040, Spain.

出版信息

BMC Syst Biol. 2018 Nov 20;12(Suppl 5):96. doi: 10.1186/s12918-018-0614-6.

DOI:10.1186/s12918-018-0614-6
PMID:30458766
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6245597/
Abstract

BACKGROUND

The Smith-Waterman (SW) algorithm is the best choice for searching similar regions between two DNA or protein sequences. However, it may become impracticable in some contexts due to its high computational demands. Consequently, the computer science community has focused on the use of modern parallel architectures such as Graphics Processing Units (GPUs), Xeon Phi accelerators and Field Programmable Gate Arrays (FGPAs) to speed up large-scale workloads.

RESULTS

This paper presents and evaluates SWIFOLD: a Smith-Waterman parallel Implementation on FPGA with OpenCL for Long DNA sequences. First, we evaluate its performance and resource usage for different kernel configurations. Next, we carry out a performance comparison between our tool and other state-of-the-art implementations considering three different datasets. SWIFOLD offers the best average performance for small and medium test sets, achieving a performance that is independent of input size and sequence similarity. In addition, SWIFOLD provides competitive performance rates in comparison with GPU-based implementations on the latest GPU generation for the large dataset.

CONCLUSIONS

The results suggest that SWIFOLD can be a serious contender for accelerating the SW alignment of DNA sequences of unrestricted size in an affordable way reaching on average 125 GCUPS and almost a peak of 270 GCUPS.

摘要

背景

史密斯-沃特曼(SW)算法是搜索两条DNA或蛋白质序列之间相似区域的最佳选择。然而,由于其高计算需求,在某些情况下可能不切实际。因此,计算机科学界专注于使用现代并行架构,如图形处理单元(GPU)、至强融核加速器和现场可编程门阵列(FPGA)来加速大规模工作负载。

结果

本文介绍并评估了SWIFOLD:一种基于FPGA、使用OpenCL针对长DNA序列的史密斯-沃特曼并行实现。首先,我们评估了其在不同内核配置下的性能和资源使用情况。接下来,我们在考虑三个不同数据集的情况下,将我们的工具与其他最先进的实现进行了性能比较。对于中小型测试集,SWIFOLD提供了最佳的平均性能,实现了与输入大小和序列相似性无关的性能。此外,与基于最新一代GPU针对大型数据集的实现相比,SWIFOLD也提供了具有竞争力的性能率。

结论

结果表明,SWIFOLD可以成为以经济实惠的方式加速无限制大小DNA序列的SW比对的有力竞争者,平均可达125 GCUPS,峰值接近270 GCUPS。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/437a/6245597/a00d01044ce6/12918_2018_614_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/437a/6245597/2eb12c5b064b/12918_2018_614_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/437a/6245597/a00d01044ce6/12918_2018_614_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/437a/6245597/2eb12c5b064b/12918_2018_614_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/437a/6245597/a00d01044ce6/12918_2018_614_Fig2_HTML.jpg

相似文献

1
SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences.SWIFOLD:基于OpenCL在FPGA上实现的用于长DNA序列的史密斯-沃特曼算法
BMC Syst Biol. 2018 Nov 20;12(Suppl 5):96. doi: 10.1186/s12918-018-0614-6.
2
ADEPT: a domain independent sequence alignment strategy for gpu architectures.ADEPT:一种适用于 GPU 架构的与领域无关的序列比对策略。
BMC Bioinformatics. 2020 Sep 15;21(1):406. doi: 10.1186/s12859-020-03720-1.
3
160-fold acceleration of the Smith-Waterman algorithm using a field programmable gate array (FPGA).使用现场可编程门阵列(FPGA)将史密斯-沃特曼算法加速160倍。
BMC Bioinformatics. 2007 Jun 7;8:185. doi: 10.1186/1471-2105-8-185.
4
Parallel algorithms for large-scale biological sequence alignment on Xeon-Phi based clusters.基于至强融核集群的大规模生物序列比对并行算法
BMC Bioinformatics. 2016 Jul 19;17 Suppl 9(Suppl 9):267. doi: 10.1186/s12859-016-1128-0.
5
FPGASW: Accelerating Large-Scale Smith-Waterman Sequence Alignment Application with Backtracking on FPGA Linear Systolic Array.FPGA 软核:在 FPGA 线性脉动阵列上回溯实现大规模 Smith-Waterman 序列比对应用的加速。
Interdiscip Sci. 2018 Mar;10(1):176-188. doi: 10.1007/s12539-017-0225-8. Epub 2017 Apr 21.
6
Accelerating the Smith-Waterman algorithm with interpair pruning and band optimization for the all-pairs comparison of base sequences.通过配对间剪枝和带优化加速史密斯-沃特曼算法以进行碱基序列的全配对比较。
BMC Bioinformatics. 2015 Oct 6;16:321. doi: 10.1186/s12859-015-0744-4.
7
CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units.CUDASW++:针对支持CUDA的图形处理单元优化史密斯-沃特曼序列数据库搜索
BMC Res Notes. 2009 May 6;2:73. doi: 10.1186/1756-0500-2-73.
8
CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment.CUDA兼容的GPU卡作为用于Smith-Waterman序列比对的高效硬件加速器。
BMC Bioinformatics. 2008 Mar 26;9 Suppl 2(Suppl 2):S10. doi: 10.1186/1471-2105-9-S2-S10.
9
Improving the Mapping of Smith-Waterman Sequence Database Searches onto CUDA-Enabled GPUs.改进史密斯-沃特曼序列数据库搜索在支持CUDA的图形处理器上的映射。
Biomed Res Int. 2015;2015:185179. doi: 10.1155/2015/185179. Epub 2015 Aug 3.
10
Pairwise sequence alignment for very long sequences on GPUs.在图形处理器(GPU)上对超长序列进行成对序列比对。
Int J Bioinform Res Appl. 2014;10(4-5):345-68. doi: 10.1504/IJBRA.2014.062989.

引用本文的文献

1
CUDASW++4.0: ultra-fast GPU-based Smith-Waterman protein sequence database search.CUDASW++4.0:基于 GPU 的超快 Smith-Waterman 蛋白质序列数据库搜索。
BMC Bioinformatics. 2024 Nov 2;25(1):342. doi: 10.1186/s12859-024-05965-6.
2
An FPGA-based hardware accelerator supporting sensitive sequence homology filtering with profile hidden Markov models.基于 FPGA 的硬件加速器,支持使用隐马尔可夫模型进行敏感序列同源性过滤。
BMC Bioinformatics. 2024 Jul 29;25(1):247. doi: 10.1186/s12859-024-05879-3.
3
Proposal of Smith-Waterman algorithm on FPGA to accelerate the forward and backtracking steps.

本文引用的文献

1
Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments.Parasail:用于全局、半全局和局部成对序列比对的SIMD C库。
BMC Bioinformatics. 2016 Feb 10;17:81. doi: 10.1186/s12859-016-0930-z.
2
SSW library: an SIMD Smith-Waterman C/C++ library for use in genomic applications.SSW 库:一个用于基因组应用的 SIMD Smith-Waterman C/C++ 库。
PLoS One. 2013 Dec 4;8(12):e82138. doi: 10.1371/journal.pone.0082138. eCollection 2013.
3
SW#-GPU-enabled exact alignments on genome scale.SW#-GPU 加速的全基因组精确比对。
基于 FPGA 的 Smith-Waterman 算法加速前向和回溯步骤的提案。
PLoS One. 2022 Jun 30;17(6):e0254736. doi: 10.1371/journal.pone.0254736. eCollection 2022.
4
A Review of Parallel Implementations for the Smith-Waterman Algorithm.《Smith-Waterman 算法的并行实现综述》。
Interdiscip Sci. 2022 Mar;14(1):1-14. doi: 10.1007/s12539-021-00473-0. Epub 2021 Sep 6.
5
GPU acceleration of Darwin read overlapper for de novo assembly of long DNA reads.GPU 加速 Darwin 读重叠算法用于长 DNA reads 的从头组装。
BMC Bioinformatics. 2020 Sep 17;21(Suppl 13):388. doi: 10.1186/s12859-020-03685-1.
Bioinformatics. 2013 Oct 1;29(19):2494-5. doi: 10.1093/bioinformatics/btt410. Epub 2013 Jul 17.
4
Biology: The big challenges of big data.生物学:大数据的巨大挑战。
Nature. 2013 Jun 13;498(7453):255-60. doi: 10.1038/498255a.
5
CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions.CUDASW++ 3.0:通过结合 CPU 和 GPU 的 SIMD 指令来加速 Smith-Waterman 蛋白质数据库搜索。
BMC Bioinformatics. 2013 Apr 4;14:117. doi: 10.1186/1471-2105-14-117.
6
PARALIGN: rapid and sensitive sequence similarity searches powered by parallel computing technology.PARALIGN:由并行计算技术驱动的快速且灵敏的序列相似性搜索。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W535-9. doi: 10.1093/nar/gki423.
7
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.空位BLAST和位置特异性迭代BLAST:新一代蛋白质数据库搜索程序。
Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.
8
Identification of common molecular subsequences.常见分子子序列的鉴定
J Mol Biol. 1981 Mar 25;147(1):195-7. doi: 10.1016/0022-2836(81)90087-5.
9
An improved algorithm for matching biological sequences.一种用于匹配生物序列的改进算法。
J Mol Biol. 1982 Dec 15;162(3):705-8. doi: 10.1016/0022-2836(82)90398-9.
10
Improved tools for biological sequence comparison.用于生物序列比较的改进工具。
Proc Natl Acad Sci U S A. 1988 Apr;85(8):2444-8. doi: 10.1073/pnas.85.8.2444.