• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

《Smith-Waterman 算法的并行实现综述》。

A Review of Parallel Implementations for the Smith-Waterman Algorithm.

机构信息

School of Computer, National University of Defense Technology, Changsha, 410073, China.

出版信息

Interdiscip Sci. 2022 Mar;14(1):1-14. doi: 10.1007/s12539-021-00473-0. Epub 2021 Sep 6.

DOI:10.1007/s12539-021-00473-0
PMID:34487327
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8419822/
Abstract

The rapid advances in sequencing technology have led to an explosion of sequence data. Sequence alignment is the central and fundamental problem in many sequence analysis procedure, while local alignment is often the kernel of these algorithms. Usually, Smith-Waterman algorithm is used to find the best subsequence match between given sequences. However, the high time complexity makes the algorithm time-consuming. A lot of approaches have been developed to accelerate and parallelize it, such as vector-level parallelization, thread-level parallelization, process-level parallelization, and heterogeneous acceleration, but the current researches seem unsystematic, which hinders the further research of parallelizing the algorithm. In this paper, we summarize the current research status of parallel local alignments and describe the data layout in these work. Based on the research status, we emphasize large-scale genomic comparisons. By surveying some typical alignment tools' performance, we discuss some possible directions in the future. We hope our work will provide the developers of the alignment tool with technical principle support, and help researchers choose proper alignment tools.

摘要

测序技术的快速发展导致了序列数据的爆炸式增长。序列比对是许多序列分析过程中的核心和基本问题,而局部比对通常是这些算法的核心。通常,Smith-Waterman 算法用于在给定序列之间找到最佳子序列匹配。然而,高时间复杂度使得算法非常耗时。已经开发了许多方法来加速和并行化它,例如矢量级并行化、线程级并行化、进程级并行化和异构加速,但目前的研究似乎缺乏系统性,这阻碍了算法的进一步研究。在本文中,我们总结了并行局部比对的当前研究现状,并描述了这些工作中的数据布局。基于研究现状,我们强调了大规模基因组比较。通过调查一些典型比对工具的性能,我们讨论了未来的一些可能方向。我们希望我们的工作将为比对工具的开发人员提供技术原理支持,并帮助研究人员选择合适的比对工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/10c6866e1120/12539_2021_473_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/e10fa2bf3b85/12539_2021_473_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/de1601cb5129/12539_2021_473_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/75fd3732fe69/12539_2021_473_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/f0d4b3d6ca5e/12539_2021_473_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/7e265dbd6018/12539_2021_473_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/9b671b8c6a05/12539_2021_473_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/f6cb8acc77f9/12539_2021_473_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/3eef10c41bea/12539_2021_473_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/c3a82f408e9c/12539_2021_473_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/10c6866e1120/12539_2021_473_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/e10fa2bf3b85/12539_2021_473_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/de1601cb5129/12539_2021_473_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/75fd3732fe69/12539_2021_473_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/f0d4b3d6ca5e/12539_2021_473_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/7e265dbd6018/12539_2021_473_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/9b671b8c6a05/12539_2021_473_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/f6cb8acc77f9/12539_2021_473_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/3eef10c41bea/12539_2021_473_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/c3a82f408e9c/12539_2021_473_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9297/8419822/10c6866e1120/12539_2021_473_Fig10_HTML.jpg

相似文献

1
A Review of Parallel Implementations for the Smith-Waterman Algorithm.《Smith-Waterman 算法的并行实现综述》。
Interdiscip Sci. 2022 Mar;14(1):1-14. doi: 10.1007/s12539-021-00473-0. Epub 2021 Sep 6.
2
Striped Smith-Waterman speeds database searches six times over other SIMD implementations.条纹史密斯-沃特曼算法在数据库搜索速度上比其他单指令多数据(SIMD)实现快六倍。
Bioinformatics. 2007 Jan 15;23(2):156-61. doi: 10.1093/bioinformatics/btl582. Epub 2006 Nov 16.
3
A parallel pairwise local sequence alignment algorithm.一种并行成对局部序列比对算法。
IEEE Trans Nanobioscience. 2009 Jun;8(2):139-46. doi: 10.1109/TNB.2009.2019642. Epub 2009 Apr 10.
4
SSW library: an SIMD Smith-Waterman C/C++ library for use in genomic applications.SSW 库:一个用于基因组应用的 SIMD Smith-Waterman C/C++ 库。
PLoS One. 2013 Dec 4;8(12):e82138. doi: 10.1371/journal.pone.0082138. eCollection 2013.
5
FPGASW: Accelerating Large-Scale Smith-Waterman Sequence Alignment Application with Backtracking on FPGA Linear Systolic Array.FPGA 软核:在 FPGA 线性脉动阵列上回溯实现大规模 Smith-Waterman 序列比对应用的加速。
Interdiscip Sci. 2018 Mar;10(1):176-188. doi: 10.1007/s12539-017-0225-8. Epub 2017 Apr 21.
6
Optimizing amino acid substitution matrices with a local alignment kernel.使用局部比对核优化氨基酸替换矩阵。
BMC Bioinformatics. 2006 May 5;7:246. doi: 10.1186/1471-2105-7-246.
7
Accelerating the Smith-Waterman algorithm with interpair pruning and band optimization for the all-pairs comparison of base sequences.通过配对间剪枝和带优化加速史密斯-沃特曼算法以进行碱基序列的全配对比较。
BMC Bioinformatics. 2015 Oct 6;16:321. doi: 10.1186/s12859-015-0744-4.
8
CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment.CUDA兼容的GPU卡作为用于Smith-Waterman序列比对的高效硬件加速器。
BMC Bioinformatics. 2008 Mar 26;9 Suppl 2(Suppl 2):S10. doi: 10.1186/1471-2105-9-S2-S10.
9
Parallel Tiled Codes Implementing the Smith-Waterman Alignment Algorithm for Two and Three Sequences.实现两个和三个序列的史密斯-沃特曼比对算法的并行平铺码
J Comput Biol. 2018 Oct;25(10):1106-1119. doi: 10.1089/cmb.2018.0006. Epub 2018 Jul 11.
10
Pairwise alignment for very long nucleic acid sequences.非常长的核酸序列的两两比对。
Biochem Biophys Res Commun. 2018 Jul 20;502(3):313-317. doi: 10.1016/j.bbrc.2018.05.134. Epub 2018 May 29.

引用本文的文献

1
A survey of sequence-to-graph mapping algorithms in the pangenome era.泛基因组时代序列到图谱映射算法综述。
Genome Biol. 2025 May 22;26(1):138. doi: 10.1186/s13059-025-03606-6.
2
Fast noisy long read alignment with multi-level parallelism.基于多级并行的快速噪声长读比对
BMC Bioinformatics. 2025 May 2;26(1):118. doi: 10.1186/s12859-025-06129-w.
3
SWIGH-SCORE: A translational light-weight approach in computational detection of rearranged immunoglobulin heavy chain to be used in monoclonal lymphoproliferative disorders.

本文引用的文献

1
Parallel computing for genome sequence processing.基因组序列处理的并行计算。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab070.
2
ADEPT: a domain independent sequence alignment strategy for gpu architectures.ADEPT:一种适用于 GPU 架构的与领域无关的序列比对策略。
BMC Bioinformatics. 2020 Sep 15;21(1):406. doi: 10.1186/s12859-020-03720-1.
3
Analyzing COVID-19 virus based on enhanced fragmented biological Local Aligner using improved Ions Motion Optimization algorithm.基于改进的离子运动优化算法的增强型片段化生物局部比对器分析新冠病毒。
SWIGH评分:一种用于单克隆淋巴细胞增殖性疾病的计算检测重排免疫球蛋白重链的转化轻量级方法。
MethodsX. 2024 May 10;12:102741. doi: 10.1016/j.mex.2024.102741. eCollection 2024 Jun.
Appl Soft Comput. 2020 Nov;96:106683. doi: 10.1016/j.asoc.2020.106683. Epub 2020 Sep 3.
4
Evolution of biosequence search algorithms: a brief survey.生物序列搜索算法的发展历程:简要综述。
Bioinformatics. 2019 Oct 1;35(19):3547-3552. doi: 10.1093/bioinformatics/btz272.
5
SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences.SWIFOLD:基于OpenCL在FPGA上实现的用于长DNA序列的史密斯-沃特曼算法
BMC Syst Biol. 2018 Nov 20;12(Suppl 5):96. doi: 10.1186/s12918-018-0614-6.
6
Minimap2: pairwise alignment for nucleotide sequences.Minimap2:核苷酸序列的两两比对。
Bioinformatics. 2018 Sep 15;34(18):3094-3100. doi: 10.1093/bioinformatics/bty191.
7
Generic accelerated sequence alignment in SeqAn using vectorization and multi-threading.使用矢量化和多线程在 SeqAn 中进行通用加速序列比对。
Bioinformatics. 2018 Oct 15;34(20):3437-3445. doi: 10.1093/bioinformatics/bty380.
8
Introducing difference recurrence relations for faster semi-global alignment of long sequences.引入差异递归关系以加快长序列的半全局比对。
BMC Bioinformatics. 2018 Feb 19;19(Suppl 1):45. doi: 10.1186/s12859-018-2014-8.
9
FPGASW: Accelerating Large-Scale Smith-Waterman Sequence Alignment Application with Backtracking on FPGA Linear Systolic Array.FPGA 软核:在 FPGA 线性脉动阵列上回溯实现大规模 Smith-Waterman 序列比对应用的加速。
Interdiscip Sci. 2018 Mar;10(1):176-188. doi: 10.1007/s12539-017-0225-8. Epub 2017 Apr 21.
10
Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments.Parasail:用于全局、半全局和局部成对序列比对的SIMD C库。
BMC Bioinformatics. 2016 Feb 10;17:81. doi: 10.1186/s12859-016-0930-z.