• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 FPGA 的 Smith-Waterman 算法加速前向和回溯步骤的提案。

Proposal of Smith-Waterman algorithm on FPGA to accelerate the forward and backtracking steps.

机构信息

Laboratory of Machine Learning and Intelligent Instrumentation, nPITI/IMD, Federal University of Rio Grande do Norte, Natal, Brazil.

Bioinformatics Multidisciplinary Environment (BioME), Federal University of Rio Grande do Norte, Natal 59078-970, RN, Brazil.

出版信息

PLoS One. 2022 Jun 30;17(6):e0254736. doi: 10.1371/journal.pone.0254736. eCollection 2022.

DOI:10.1371/journal.pone.0254736
PMID:35772072
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9246398/
Abstract

In bioinformatics, alignment is an essential technique for finding similarities between biological sequences. Usually, the alignment is performed with the Smith-Waterman (SW) algorithm, a well-known sequence alignment technique of high-level precision based on dynamic programming. However, given the massive data volume in biological databases and their continuous exponential increase, high-speed data processing is necessary. Therefore, this work proposes a parallel hardware design for the SW algorithm with a systolic array structure to accelerate the forward and backtracking steps. For this purpose, the architecture calculates and stores the paths in the forward stage for pre-organizing the alignment, which reduces the complexity of the backtracking stage. The backtracking starts from the maximum score position in the matrix and generates the optimal SW sequence alignment path. The architecture was validated on Field-Programmable Gate Array (FPGA), and synthesis analyses have shown that the proposed design reaches up to 79.5 Giga Cell Updates per Second (GCPUS).

摘要

在生物信息学中,比对是发现生物序列之间相似性的一项基本技术。通常,比对是通过 Smith-Waterman(SW)算法完成的,这是一种基于动态规划的高精度的著名序列比对技术。然而,考虑到生物数据库中的海量数据量及其持续的指数级增长,高速数据处理是必要的。因此,这项工作提出了一种基于脉动阵列结构的 SW 算法的并行硬件设计,以加速前向和回溯步骤。为此,该架构在向前阶段计算和存储路径,以便于对齐,从而降低回溯阶段的复杂度。回溯从矩阵中的最大得分位置开始,并生成最优的 SW 序列比对路径。该架构已在现场可编程门阵列(FPGA)上进行了验证,综合分析表明,所提出的设计达到了每秒 79.5 吉细胞更新(GCPUS)的速度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/5444364c8f79/pone.0254736.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/bfd6ccbe4605/pone.0254736.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/e64b7327f5f9/pone.0254736.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/285e311c7f0a/pone.0254736.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/c65d75a58917/pone.0254736.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/5f747b639041/pone.0254736.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/b0705ac6f36e/pone.0254736.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/c410467fd405/pone.0254736.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/762d16e897a8/pone.0254736.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/f0aaa66ceb34/pone.0254736.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/bd5231884652/pone.0254736.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/d55aab703127/pone.0254736.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/5444364c8f79/pone.0254736.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/bfd6ccbe4605/pone.0254736.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/e64b7327f5f9/pone.0254736.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/285e311c7f0a/pone.0254736.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/c65d75a58917/pone.0254736.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/5f747b639041/pone.0254736.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/b0705ac6f36e/pone.0254736.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/c410467fd405/pone.0254736.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/762d16e897a8/pone.0254736.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/f0aaa66ceb34/pone.0254736.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/bd5231884652/pone.0254736.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/d55aab703127/pone.0254736.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec96/9246398/5444364c8f79/pone.0254736.g012.jpg

相似文献

1
Proposal of Smith-Waterman algorithm on FPGA to accelerate the forward and backtracking steps.基于 FPGA 的 Smith-Waterman 算法加速前向和回溯步骤的提案。
PLoS One. 2022 Jun 30;17(6):e0254736. doi: 10.1371/journal.pone.0254736. eCollection 2022.
2
FPGASW: Accelerating Large-Scale Smith-Waterman Sequence Alignment Application with Backtracking on FPGA Linear Systolic Array.FPGA 软核:在 FPGA 线性脉动阵列上回溯实现大规模 Smith-Waterman 序列比对应用的加速。
Interdiscip Sci. 2018 Mar;10(1):176-188. doi: 10.1007/s12539-017-0225-8. Epub 2017 Apr 21.
3
160-fold acceleration of the Smith-Waterman algorithm using a field programmable gate array (FPGA).使用现场可编程门阵列(FPGA)将史密斯-沃特曼算法加速160倍。
BMC Bioinformatics. 2007 Jun 7;8:185. doi: 10.1186/1471-2105-8-185.
4
Pairwise alignment for very long nucleic acid sequences.非常长的核酸序列的两两比对。
Biochem Biophys Res Commun. 2018 Jul 20;502(3):313-317. doi: 10.1016/j.bbrc.2018.05.134. Epub 2018 May 29.
5
Parallel Tiled Codes Implementing the Smith-Waterman Alignment Algorithm for Two and Three Sequences.实现两个和三个序列的史密斯-沃特曼比对算法的并行平铺码
J Comput Biol. 2018 Oct;25(10):1106-1119. doi: 10.1089/cmb.2018.0006. Epub 2018 Jul 11.
6
CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment.CUDA兼容的GPU卡作为用于Smith-Waterman序列比对的高效硬件加速器。
BMC Bioinformatics. 2008 Mar 26;9 Suppl 2(Suppl 2):S10. doi: 10.1186/1471-2105-9-S2-S10.
7
SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences.SWIFOLD:基于OpenCL在FPGA上实现的用于长DNA序列的史密斯-沃特曼算法
BMC Syst Biol. 2018 Nov 20;12(Suppl 5):96. doi: 10.1186/s12918-018-0614-6.
8
Hardware accelerator for genomic sequence alignment.用于基因组序列比对的硬件加速器
Conf Proc IEEE Eng Med Biol Soc. 2006;2006:5787-9. doi: 10.1109/IEMBS.2006.260286.
9
Accelerating the Smith-Waterman algorithm with interpair pruning and band optimization for the all-pairs comparison of base sequences.通过配对间剪枝和带优化加速史密斯-沃特曼算法以进行碱基序列的全配对比较。
BMC Bioinformatics. 2015 Oct 6;16:321. doi: 10.1186/s12859-015-0744-4.
10
Heuristic reusable dynamic programming: efficient updates of local sequence alignment.启发式可重用动态规划:局部序列比对的高效更新。
IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):570-82. doi: 10.1109/TCBB.2009.30.

引用本文的文献

1
CUDASW++4.0: ultra-fast GPU-based Smith-Waterman protein sequence database search.CUDASW++4.0:基于 GPU 的超快 Smith-Waterman 蛋白质序列数据库搜索。
BMC Bioinformatics. 2024 Nov 2;25(1):342. doi: 10.1186/s12859-024-05965-6.

本文引用的文献

1
FPGA-Based Processor Acceleration for Image Processing Applications.用于图像处理应用的基于现场可编程门阵列的处理器加速
J Imaging. 2019 Jan 13;5(1):16. doi: 10.3390/jimaging5010016.
2
Fully Parallel Implementation of Otsu Automatic Image Thresholding Algorithm on FPGA.基于 FPGA 的 Otsu 自动图像阈值算法的完全并行实现。
Sensors (Basel). 2021 Jun 17;21(12):4151. doi: 10.3390/s21124151.
3
A full-parallel implementation of Self-Organizing Maps on hardware.硬件上的自组织映射的全并行实现。
Neural Netw. 2021 Nov;143:818-827. doi: 10.1016/j.neunet.2021.05.021. Epub 2021 May 21.
4
Identifying optimal COVID-19 testing strategies for schools and businesses: Balancing testing frequency, individual test technology, and cost.确定学校和企业的最佳 COVID-19 检测策略:平衡检测频率、个体检测技术和成本。
PLoS One. 2021 Mar 25;16(3):e0248783. doi: 10.1371/journal.pone.0248783. eCollection 2021.
5
16S rRNA gene sequencing of rectal swab in patients affected by COVID-19.对 COVID-19 患者的直肠拭子进行 16S rRNA 基因测序。
PLoS One. 2021 Feb 17;16(2):e0247041. doi: 10.1371/journal.pone.0247041. eCollection 2021.
6
SARS-CoV-2 spillover events.严重急性呼吸综合征冠状病毒2溢出事件。
Science. 2021 Jan 8;371(6525):120-122. doi: 10.1126/science.abf6097.
7
Full genome viral sequences inform patterns of SARS-CoV-2 spread into and within Israel.全基因组病毒序列揭示了 SARS-CoV-2 在以色列境内和之间传播的模式。
Nat Commun. 2020 Nov 2;11(1):5518. doi: 10.1038/s41467-020-19248-0.
8
Practical guide for managing large-scale human genome data in research.研究中管理大规模人类基因组数据的实用指南。
J Hum Genet. 2021 Jan;66(1):39-52. doi: 10.1038/s10038-020-00862-1. Epub 2020 Oct 23.
9
Accelerating next generation sequencing data analysis: an evaluation of optimized best practices for Genome Analysis Toolkit algorithms.加速下一代测序数据分析:对基因组分析工具包算法优化最佳实践的评估
Genomics Inform. 2020 Mar;18(1):e10. doi: 10.5808/GI.2020.18.1.e10. Epub 2020 Mar 31.
10
Bioinformatics and Computational Tools for Next-Generation Sequencing Analysis in Clinical Genetics.临床遗传学中用于下一代测序分析的生物信息学和计算工具
J Clin Med. 2020 Jan 3;9(1):132. doi: 10.3390/jcm9010132.