• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种在图形处理单元上使用 warp 混洗操作的基于莱文斯坦距离的并行近似字符串匹配。

A parallel approximate string matching under Levenshtein distance on graphics processing units using warp-shuffle operations.

作者信息

Ho ThienLuan, Oh Seung-Rohk, Kim HyunJin

机构信息

School of Electronics and Electrical Engineering, Dankook University, Yongin-si, Republic of Korea.

出版信息

PLoS One. 2017 Oct 10;12(10):e0186251. doi: 10.1371/journal.pone.0186251. eCollection 2017.

DOI:10.1371/journal.pone.0186251
PMID:29016700
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5634649/
Abstract

Approximate string matching with k-differences has a number of practical applications, ranging from pattern recognition to computational biology. This paper proposes an efficient memory-access algorithm for parallel approximate string matching with k-differences on Graphics Processing Units (GPUs). In the proposed algorithm, all threads in the same GPUs warp share data using warp-shuffle operation instead of accessing the shared memory. Moreover, we implement the proposed algorithm by exploiting the memory structure of GPUs to optimize its performance. Experiment results for real DNA packages revealed that the performance of the proposed algorithm and its implementation archived up to 122.64 and 1.53 times compared to that of sequential algorithm on CPU and previous parallel approximate string matching algorithm on GPUs, respectively.

摘要

具有k差异的近似字符串匹配有许多实际应用,从模式识别到计算生物学。本文提出了一种高效的内存访问算法,用于在图形处理单元(GPU)上进行具有k差异的并行近似字符串匹配。在所提出的算法中,同一GPU warp中的所有线程使用warp-shuffle操作共享数据,而不是访问共享内存。此外,我们通过利用GPU的内存结构来实现所提出的算法,以优化其性能。对真实DNA包的实验结果表明,所提出的算法及其实现的性能分别比CPU上的顺序算法和GPU上以前的并行近似字符串匹配算法提高了122.64倍和1.53倍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/ae2e57179610/pone.0186251.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/44d8e42f1f31/pone.0186251.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/50e6dbb430e6/pone.0186251.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/b880984e3d1f/pone.0186251.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/30566de262b7/pone.0186251.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/9f0fe9e18afe/pone.0186251.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/f5983f3fe875/pone.0186251.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/ae2e57179610/pone.0186251.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/44d8e42f1f31/pone.0186251.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/50e6dbb430e6/pone.0186251.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/b880984e3d1f/pone.0186251.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/30566de262b7/pone.0186251.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/9f0fe9e18afe/pone.0186251.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/f5983f3fe875/pone.0186251.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2e29/5634649/ae2e57179610/pone.0186251.g007.jpg

相似文献

1
A parallel approximate string matching under Levenshtein distance on graphics processing units using warp-shuffle operations.一种在图形处理单元上使用 warp 混洗操作的基于莱文斯坦距离的并行近似字符串匹配。
PLoS One. 2017 Oct 10;12(10):e0186251. doi: 10.1371/journal.pone.0186251. eCollection 2017.
2
Parallel Implementation of MAFFT on CUDA-Enabled Graphics Hardware.MAFFT在支持CUDA的图形硬件上的并行实现。
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):205-18. doi: 10.1109/TCBB.2014.2351801.
3
libFLASM: a software library for fixed-length approximate string matching.libFLASM:一个用于固定长度近似字符串匹配的软件库。
BMC Bioinformatics. 2016 Nov 10;17(1):454. doi: 10.1186/s12859-016-1320-2.
4
A fast forward projection using multithreads for multirays on GPUs in medical image reconstruction.基于 GPU 的医学图像重建中多线程快速前向投影的多射线算法。
Med Phys. 2011 Jul;38(7):4052-65. doi: 10.1118/1.3591994.
5
GPUDePiCt: A Parallel Implementation of a Clustering Algorithm for Computing Degenerate Primers on Graphics Processing Units.GPUDePiCt:一种用于在图形处理单元上计算简并引物的聚类算法的并行实现。
IEEE/ACM Trans Comput Biol Bioinform. 2015 Mar-Apr;12(2):445-54. doi: 10.1109/TCBB.2014.2355231.
6
A Hybrid CPU/GPU Pattern-Matching Algorithm for Deep Packet Inspection.一种用于深度包检测的混合CPU/GPU模式匹配算法。
PLoS One. 2015 Oct 5;10(10):e0139301. doi: 10.1371/journal.pone.0139301. eCollection 2015.
7
Exploiting graphics processing units for computational biology and bioinformatics.利用图形处理单元进行计算生物学和生物信息学。
Interdiscip Sci. 2010 Sep;2(3):213-20. doi: 10.1007/s12539-010-0002-4. Epub 2010 Jul 25.
8
Parallel Methods for Finding k-Mismatch Shortest Unique Substrings Using GPU.
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):386-395. doi: 10.1109/TCBB.2019.2935061. Epub 2021 Feb 4.
9
Using GPUs for the exact alignment of short-read genetic sequences by means of the Burrows-Wheeler transform.利用图形处理单元(GPU)通过 Burrows-Wheeler 变换实现短读基因序列的精确比对。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1245-56. doi: 10.1109/TCBB.2012.49.
10
Improved algorithms for approximate string matching (extended abstract).用于近似字符串匹配的改进算法(扩展摘要)。
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S10. doi: 10.1186/1471-2105-10-S1-S10.

引用本文的文献

1
A k-mismatch string matching for generalized edit distance using diagonal skipping method.基于对角线跳跃法的广义编辑距离的 k 不匹配字符串匹配。
PLoS One. 2021 May 4;16(5):e0251047. doi: 10.1371/journal.pone.0251047. eCollection 2021.
2
DLI-IT: a deep learning approach to drug label identification through image and text embedding.DLI-IT:一种通过图像和文本嵌入进行药物标签识别的深度学习方法。
BMC Med Inform Decis Mak. 2020 Apr 15;20(1):68. doi: 10.1186/s12911-020-1078-3.

本文引用的文献

1
CMSA: a heterogeneous CPU/GPU computing system for multiple similar RNA/DNA sequence alignment.CMSA:一种用于多个相似RNA/DNA序列比对的异构CPU/GPU计算系统。
BMC Bioinformatics. 2017 Jun 24;18(1):315. doi: 10.1186/s12859-017-1725-6.
2
A Pipelined Non-Deterministic Finite Automaton-Based String Matching Scheme Using Merged State Transitions in an FPGA.一种基于流水线非确定性有限自动机的字符串匹配方案,该方案在现场可编程门阵列中使用合并状态转换
PLoS One. 2016 Oct 3;11(10):e0163535. doi: 10.1371/journal.pone.0163535. eCollection 2016.
3
HAlign: Fast multiple similar DNA/RNA sequence alignment based on the centre star strategy.
HAlign:基于中心星型策略的快速多重相似DNA/RNA序列比对
Bioinformatics. 2015 Aug 1;31(15):2475-81. doi: 10.1093/bioinformatics/btv177. Epub 2015 Mar 25.
4
Survey of MapReduce frame operation in bioinformatics.生物信息学中MapReduce框架操作的调查。
Brief Bioinform. 2014 Jul;15(4):637-47. doi: 10.1093/bib/bbs088. Epub 2013 Feb 7.
5
Analysing humanly generated random number sequences: a pattern-based approach.分析人为生成的随机数序列:基于模式的方法。
PLoS One. 2012;7(7):e41531. doi: 10.1371/journal.pone.0041531. Epub 2012 Jul 23.
6
Application of approximate pattern matching in two dimensional spaces to grid layout for biochemical network maps.二维空间中近似模式匹配在生化网络图网格布局中的应用。
PLoS One. 2012;7(6):e37739. doi: 10.1371/journal.pone.0037739. Epub 2012 Jun 5.