• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用PaSWAS在通用并行图形处理单元上进行灵活、快速且准确的序列比对分析。

Flexible, fast and accurate sequence alignment profiling on GPGPU with PaSWAS.

作者信息

Warris Sven, Yalcin Feyruz, Jackson Katherine J L, Nap Jan Peter

机构信息

Institute for Life Science & Technology & Hanze Research Center Energy, Hanze University of Applied Sciences Groningen, 9747 AS, Zernikeplein 11, Groningen, The Netherlands.

KeyGene N. V., 6708 PW, Agro Business Park 90, Wageningen, The Netherlands.

出版信息

PLoS One. 2015 Apr 1;10(4):e0122524. doi: 10.1371/journal.pone.0122524. eCollection 2015.

DOI:10.1371/journal.pone.0122524
PMID:25830241
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4382095/
Abstract

MOTIVATION

To obtain large-scale sequence alignments in a fast and flexible way is an important step in the analyses of next generation sequencing data. Applications based on the Smith-Waterman (SW) algorithm are often either not fast enough, limited to dedicated tasks or not sufficiently accurate due to statistical issues. Current SW implementations that run on graphics hardware do not report the alignment details necessary for further analysis.

RESULTS

With the Parallel SW Alignment Software (PaSWAS) it is possible (a) to have easy access to the computational power of NVIDIA-based general purpose graphics processing units (GPGPUs) to perform high-speed sequence alignments, and (b) retrieve relevant information such as score, number of gaps and mismatches. The software reports multiple hits per alignment. The added value of the new SW implementation is demonstrated with two test cases: (1) tag recovery in next generation sequence data and (2) isotype assignment within an immunoglobulin 454 sequence data set. Both cases show the usability and versatility of the new parallel Smith-Waterman implementation.

摘要

动机

以快速且灵活的方式获得大规模序列比对是下一代测序数据分析中的重要一步。基于史密斯-沃特曼(SW)算法的应用程序通常速度不够快,仅限于特定任务,或者由于统计问题而不够准确。当前在图形硬件上运行的SW实现并未报告进一步分析所需的比对细节。

结果

使用并行SW比对软件(PaSWAS),可以(a)轻松利用基于NVIDIA的通用图形处理单元(GPGPU)的计算能力来执行高速序列比对,以及(b)检索诸如得分、缺口数量和错配数量等相关信息。该软件会报告每次比对的多个命中结果。通过两个测试用例展示了新的SW实现的附加价值:(1)下一代序列数据中的标签恢复,以及(2)免疫球蛋白454序列数据集中的同种型分配。这两个案例均展示了新的并行史密斯-沃特曼实现的可用性和通用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/146b51d478f5/pone.0122524.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/4a029675f62e/pone.0122524.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/cb38aed32f8d/pone.0122524.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/2c927f30555d/pone.0122524.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/ffe86e90f412/pone.0122524.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/146b51d478f5/pone.0122524.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/4a029675f62e/pone.0122524.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/cb38aed32f8d/pone.0122524.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/2c927f30555d/pone.0122524.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/ffe86e90f412/pone.0122524.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ee/4382095/146b51d478f5/pone.0122524.g005.jpg

相似文献

1
Flexible, fast and accurate sequence alignment profiling on GPGPU with PaSWAS.使用PaSWAS在通用并行图形处理单元上进行灵活、快速且准确的序列比对分析。
PLoS One. 2015 Apr 1;10(4):e0122524. doi: 10.1371/journal.pone.0122524. eCollection 2015.
2
pyPaSWAS: Python-based multi-core CPU and GPU sequence alignment.pyPaSWAS:基于Python的多核CPU和GPU序列比对工具。
PLoS One. 2018 Jan 2;13(1):e0190279. doi: 10.1371/journal.pone.0190279. eCollection 2018.
3
CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment.CUDA兼容的GPU卡作为用于Smith-Waterman序列比对的高效硬件加速器。
BMC Bioinformatics. 2008 Mar 26;9 Suppl 2(Suppl 2):S10. doi: 10.1186/1471-2105-9-S2-S10.
4
SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences.SWIFOLD:基于OpenCL在FPGA上实现的用于长DNA序列的史密斯-沃特曼算法
BMC Syst Biol. 2018 Nov 20;12(Suppl 5):96. doi: 10.1186/s12918-018-0614-6.
5
FPGASW: Accelerating Large-Scale Smith-Waterman Sequence Alignment Application with Backtracking on FPGA Linear Systolic Array.FPGA 软核:在 FPGA 线性脉动阵列上回溯实现大规模 Smith-Waterman 序列比对应用的加速。
Interdiscip Sci. 2018 Mar;10(1):176-188. doi: 10.1007/s12539-017-0225-8. Epub 2017 Apr 21.
6
A novel partial sequence alignment tool for finding large deletions.一种用于查找大片段缺失的新型局部序列比对工具。
ScientificWorldJournal. 2012;2012:694813. doi: 10.1100/2012/694813. Epub 2012 Apr 1.
7
Parallel Tiled Codes Implementing the Smith-Waterman Alignment Algorithm for Two and Three Sequences.实现两个和三个序列的史密斯-沃特曼比对算法的并行平铺码
J Comput Biol. 2018 Oct;25(10):1106-1119. doi: 10.1089/cmb.2018.0006. Epub 2018 Jul 11.
8
BarraCUDA - a fast short read sequence aligner using graphics processing units.BarraCUDA——一种使用图形处理单元的快速短读序列比对工具。
BMC Res Notes. 2012 Jan 13;5:27. doi: 10.1186/1756-0500-5-27.
9
Exact and complete short-read alignment to microbial genomes using Graphics Processing Unit programming.使用图形处理单元编程实现微生物基因组的精确和完整短读序列比对。
Bioinformatics. 2011 May 15;27(10):1351-8. doi: 10.1093/bioinformatics/btr151. Epub 2011 Mar 30.
10
SSW library: an SIMD Smith-Waterman C/C++ library for use in genomic applications.SSW 库:一个用于基因组应用的 SIMD Smith-Waterman C/C++ 库。
PLoS One. 2013 Dec 4;8(12):e82138. doi: 10.1371/journal.pone.0082138. eCollection 2013.

引用本文的文献

1
Identifying T cell antigen at the atomic level with graph convolutional network.利用图卷积网络在原子水平上识别T细胞抗原。
Nat Commun. 2025 Jun 4;16(1):5171. doi: 10.1038/s41467-025-60461-6.
2
Correcting palindromes in long reads after whole-genome amplification.长读段全基因组扩增后回文序列的校正。
BMC Genomics. 2018 Nov 6;19(1):798. doi: 10.1186/s12864-018-5164-1.
3
GPU-DAEMON: GPU algorithm design, data management & optimization template for array based big omics data.GPU-DAEMON:基于数组的大型组学数据的 GPU 算法设计、数据管理和优化模板。

本文引用的文献

1
Trimmomatic: a flexible trimmer for Illumina sequence data.Trimmomatic:一款适用于 Illumina 测序数据的灵活修剪工具。
Bioinformatics. 2014 Aug 1;30(15):2114-20. doi: 10.1093/bioinformatics/btu170. Epub 2014 Apr 1.
2
SW#-GPU-enabled exact alignments on genome scale.SW#-GPU 加速的全基因组精确比对。
Bioinformatics. 2013 Oct 1;29(19):2494-5. doi: 10.1093/bioinformatics/btt410. Epub 2013 Jul 17.
3
Kraken: a set of tools for quality control and analysis of high-throughput sequence data.Kraken:一组用于高通量测序数据质量控制和分析的工具。
Comput Biol Med. 2018 Oct 1;101:163-173. doi: 10.1016/j.compbiomed.2018.08.015. Epub 2018 Aug 16.
4
pyPaSWAS: Python-based multi-core CPU and GPU sequence alignment.pyPaSWAS:基于Python的多核CPU和GPU序列比对工具。
PLoS One. 2018 Jan 2;13(1):e0190279. doi: 10.1371/journal.pone.0190279. eCollection 2018.
5
Living apart together: crosstalk between the core and supernumerary genomes in a fungal plant pathogen.分开同居:一种真菌植物病原体中核心基因组与额外基因组之间的相互作用
BMC Genomics. 2016 Aug 23;17(1):670. doi: 10.1186/s12864-016-2941-6.
Methods. 2013 Sep 1;63(1):41-9. doi: 10.1016/j.ymeth.2013.06.027. Epub 2013 Jun 29.
4
Topical perspective on massive threading and parallelism.关于大规模线程化和并行性的专题视角。
J Mol Graph Model. 2011 Sep;30:82-9. doi: 10.1016/j.jmgm.2011.06.007. Epub 2011 Jun 29.
5
GPU-BLAST: using graphics processors to accelerate protein sequence alignment.GPU-BLAST:利用图形处理器加速蛋白质序列比对。
Bioinformatics. 2011 Jan 15;27(2):182-8. doi: 10.1093/bioinformatics/btq644. Epub 2010 Nov 18.
6
De novo assembly of human genomes with massively parallel short read sequencing.利用大规模平行短读测序进行人类基因组从头组装。
Genome Res. 2010 Feb;20(2):265-72. doi: 10.1101/gr.097261.109. Epub 2009 Dec 17.
7
Sequencing technologies - the next generation.测序技术——下一代。
Nat Rev Genet. 2010 Jan;11(1):31-46. doi: 10.1038/nrg2626. Epub 2009 Dec 8.
8
MicroRazerS: rapid alignment of small RNA reads.MicroRazerS:快速对齐小 RNA reads。
Bioinformatics. 2010 Jan 1;26(1):123-4. doi: 10.1093/bioinformatics/btp601. Epub 2009 Oct 29.
9
Next-generation sequencing methods: impact of sequencing accuracy on SNP discovery.下一代测序方法:测序准确性对单核苷酸多态性发现的影响。
Methods Mol Biol. 2009;578:95-111. doi: 10.1007/978-1-60327-411-1_5.
10
SNP detection for massively parallel whole-genome resequencing.用于大规模平行全基因组重测序的单核苷酸多态性检测
Genome Res. 2009 Jun;19(6):1124-32. doi: 10.1101/gr.088013.108. Epub 2009 May 6.