Feng Yilin, Gudukbay Akbulut Gulsum, Tang Xulong, Gunasekaran Jashwant Raj, Rahman Amatur, Medvedev Paul, Kandemir Mahmut
Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA 16802, USA.
Department of Computer Science, University of Pittsburgh, Pittsburgh, PA 15260, USA.
Bioinform Adv. 2022 Nov 30;2(1):vbac088. doi: 10.1093/bioadv/vbac088. eCollection 2022.
The third-generation DNA sequencing technologies, such as Nanopore Sequencing, can operate at very high speeds and produce longer reads, which in turn results in a challenge for the computational analysis of such massive data. is a software package for signal-level analysis of Oxford Nanopore sequencing data. Call-methylation module of can detect methylation based on Hidden Markov Model (HMM). However, is limited by the long running time of some serial and computationally expensive processes. Among these, Adaptive Banded Event Alignment (ABEA) is the most time-consuming step, and the prior work, , has already parallelized and optimized ABEA on GPU. As a result, the remaining methylation score calculation part, which uses HMM to identify if a given base is methylated or not, has become the new performance bottleneck.
This article focuses on the call-methylation module that resides in the package. We propose , which parallelizes and optimizes the methylation score calculation step on GPU and then pipelines the four steps of the call-methylation module. increases the execution concurrency across CPUs and GPUs as well as hardware resource utilization for both. The experimental results collected indicate that can achieve 3×-5× speedup compared with , and reduce the total execution time by 35% compared with , on average.
The source code of is available at https://github.com/fengyilin118/.
第三代DNA测序技术,如纳米孔测序,能够以非常高的速度运行并产生更长的读段,这反过来给如此海量数据的计算分析带来了挑战。[具体软件名称]是一个用于牛津纳米孔测序数据信号级分析的软件包。[具体软件名称]的调用甲基化模块可以基于隐马尔可夫模型(HMM)检测甲基化。然而,[具体软件名称]受到一些串行且计算成本高昂的过程运行时间长的限制。其中,自适应带状事件比对(ABEA)是最耗时的步骤,之前的工作[相关工作名称]已经在图形处理器(GPU)上对ABEA进行了并行化和优化。因此,使用HMM来确定给定碱基是否甲基化的剩余甲基化分数计算部分,成为了新的性能瓶颈。
本文聚焦于[具体软件名称]包中的调用甲基化模块。我们提出了[具体方法名称],它在GPU上对甲基化分数计算步骤进行并行化和优化,然后将调用甲基化模块的四个步骤进行流水线处理。[具体方法名称]提高了跨中央处理器(CPU)和GPU的执行并发度以及两者的硬件资源利用率。收集到的实验结果表明,与[对比对象1]相比,[具体方法名称]可以实现3倍至5倍的加速,并且与[对比对象2]相比,平均将总执行时间减少了35%。
[具体方法名称]的源代码可在https://github.com/fengyilin118/获取。