Suppr超能文献

Arioc:在多个 GPU 上进行高并发性短读对齐。

Arioc: High-concurrency short-read alignment on multiple GPUs.

机构信息

Department of Physics and Astronomy, Johns Hopkins University, Baltimore, Maryland, United States of America.

Department of Computer Science, Johns Hopkins University, Baltimore, Maryland, United States of America.

出版信息

PLoS Comput Biol. 2020 Nov 9;16(11):e1008383. doi: 10.1371/journal.pcbi.1008383. eCollection 2020 Nov.

Abstract

In large DNA sequence repositories, archival data storage is often coupled with computers that provide 40 or more CPU threads and multiple GPU (general-purpose graphics processing unit) devices. This presents an opportunity for DNA sequence alignment software to exploit high-concurrency hardware to generate short-read alignments at high speed. Arioc, a GPU-accelerated short-read aligner, can compute WGS (whole-genome sequencing) alignments ten times faster than comparable CPU-only alignment software. When two or more GPUs are available, Arioc's speed increases proportionately because the software executes concurrently on each available GPU device. We have adapted Arioc to recent multi-GPU hardware architectures that support high-bandwidth peer-to-peer memory accesses among multiple GPUs. By modifying Arioc's implementation to exploit this GPU memory architecture we obtained a further 1.8x-2.9x increase in overall alignment speeds. With this additional acceleration, Arioc computes two million short-read alignments per second in a four-GPU system; it can align the reads from a human WGS sequencer run-over 500 million 150nt paired-end reads-in less than 15 minutes. As WGS data accumulates exponentially and high-concurrency computational resources become widespread, Arioc addresses a growing need for timely computation in the short-read data analysis toolchain.

摘要

在大型 DNA 序列存储库中,归档数据存储通常与提供 40 个或更多 CPU 线程和多个 GPU(通用图形处理单元)设备的计算机结合使用。这为 DNA 序列比对软件提供了一个机会,可以利用高并发性硬件以高速生成短读序列比对。Arioc 是一种 GPU 加速的短读序列比对器,与仅使用 CPU 的比对软件相比,其计算 WGS(全基因组测序)比对的速度快 10 倍。当有两个或更多 GPU 可用时,Arioc 的速度会相应增加,因为软件可以在每个可用的 GPU 设备上并行执行。我们已经对最近的多 GPU 硬件架构进行了调整,这些架构支持多个 GPU 之间的高带宽对等内存访问。通过修改 Arioc 的实现以利用这种 GPU 内存架构,我们获得了整体对齐速度提高 1.8x-2.9x 的进一步提升。通过这种额外的加速,Arioc 在一个四 GPU 系统中每秒可以计算两百万个短读序列比对;它可以在不到 15 分钟的时间内对齐来自人类 WGS 测序器的超过 5 亿个 150nt 配对末端读取的读取。随着 WGS 数据呈指数级增长并且高并发性计算资源变得普及,Arioc 满足了短读数据分析工具链中及时计算的需求。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32da/7676696/52cddf117377/pcbi.1008383.g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验