Cheng Haoyu, Zhang Yong, Xu Yun
IEEE/ACM Trans Comput Biol Bioinform. 2018 Apr 3. doi: 10.1109/TCBB.2018.2822687.
The explosive growth of next-generation sequencing (NGS) read datasets drives a need for new faster read mappers. One class of read mappers, called all-mappers, is designed to identify all mapping locations of each read. Many all-mappers have been developed over the past few years, but they are either time-consuming or memory-consuming. Here, we present BitMapper2, a GPU-accelerated read mapper that reports all mapping locations of NGS reads. To make full use of the parallel processing capability of GPUs, BitMapper2 proposes the sparse q-gram index, which reduces the memory requirement and the data transfer time between GPU and CPU. We also design the filtration part and the verification part of BitMapper2 specifically for the architecture of GPU. In addition, BitMapper2 is still time-efficient and memory-efficient even if there is no GPU available. Experiments show that BitMapper2 was significantly faster than the state-of-the-art all-mappers, while requiring less space.
下一代测序(NGS)读取数据集的爆炸式增长推动了对新型更快读取映射器的需求。一类读取映射器,称为全映射器,旨在识别每个读取的所有映射位置。在过去几年中已经开发了许多全映射器,但它们要么耗时,要么耗内存。在这里,我们展示了BitMapper2,这是一种GPU加速的读取映射器,它报告NGS读取的所有映射位置。为了充分利用GPU的并行处理能力,BitMapper2提出了稀疏q-gram索引,这减少了内存需求以及GPU和CPU之间的数据传输时间。我们还专门针对GPU架构设计了BitMapper2的过滤部分和验证部分。此外,即使没有可用的GPU,BitMapper2仍然具有时间效率和内存效率。实验表明,BitMapper2比最先进的全映射器快得多,同时所需空间更少。