基于图形处理单元的术中基于强度的图像配准的性能感知编程。

Performance-aware programming for intraoperative intensity-based image registration on graphics processing units.

机构信息

Department of Mechanical Engineering, The University of Hong Kong, Pok Fu Lam, Hong Kong.

Department of Computing, Imperial College London, London, SW7 2AZ, UK.

出版信息

Int J Comput Assist Radiol Surg. 2021 Mar;16(3):375-386. doi: 10.1007/s11548-020-02303-y. Epub 2021 Jan 23.

DOI:10.1007/s11548-020-02303-y

PMID:33484431

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7946684/

Abstract

PURPOSE

Intensity-based image registration has been proven essential in many applications accredited to its unparalleled ability to resolve image misalignments. However, long registration time for image realignment prohibits its use in intra-operative navigation systems. There has been much work on accelerating the registration process by improving the algorithm's robustness, but the innate computation required by the registration algorithm has been unresolved.

METHODS

Intensity-based registration methods involve operations with high arithmetic load and memory access demand, which supposes to be reduced by graphics processing units (GPUs). Although GPUs are widespread and affordable, there is a lack of open-source GPU implementations optimized for non-rigid image registration. This paper demonstrates performance-aware programming techniques, which involves systematic exploitation of GPU features, by implementing the diffeomorphic log-demons algorithm.

RESULTS

By resolving the pinpointed computation bottlenecks on GPU, our implementation of diffeomorphic log-demons on Nvidia GTX Titan X GPU has achieved ~ 95 times speed-up compared to the CPU and registered a 1.3-M voxel image in 286 ms. Even for large 37-M voxel images, our implementation is able to register in 8.56 s, which attained ~ 258 times speed-up. Our solution involves effective employment of GPU computation units, memory, and data bandwidth to resolve computation bottlenecks.

CONCLUSION

The computation bottlenecks in diffeomorphic log-demons are pinpointed, analyzed, and resolved using various GPU performance-aware programming techniques. The proposed fast computation on basic image operations not only enhances the computation of diffeomorphic log-demons, but is also potentially extended to speed up many other intensity-based approaches. Our implementation is open-source on GitHub at https://bit.ly/2PYZxQz .

摘要

目的

基于强度的图像配准已被证明在许多应用中是必不可少的，因为它具有无与伦比的解决图像配准的能力。然而，图像重新配准的注册时间较长，限制了其在术中导航系统中的应用。已经有很多工作致力于通过提高算法的鲁棒性来加速注册过程，但是注册算法固有的计算需求尚未得到解决。

方法

基于强度的配准方法涉及到具有高算术负载和内存访问需求的操作，这些操作可以通过图形处理单元（GPU）来减少。尽管 GPU 已经广泛应用且价格低廉，但缺乏针对非刚性图像配准的优化的开源 GPU 实现。本文通过实现变形对数恶魔算法，展示了性能感知编程技术，该技术涉及对 GPU 特性的系统利用。

结果

通过在 GPU 上解决了确定的计算瓶颈，我们在 Nvidia GTX Titan X GPU 上实现的变形对数恶魔算法的速度比 CPU 快了约 95 倍，并在 286 毫秒内注册了 1300 万体素的图像。即使对于 3700 万体素的大型图像，我们的实现也能够在 8.56 秒内完成注册，速度提高了约 258 倍。我们的解决方案涉及到有效利用 GPU 的计算单元、内存和数据带宽来解决计算瓶颈。

结论

通过各种 GPU 性能感知编程技术，确定了变形对数恶魔中的计算瓶颈，并对其进行了分析和解决。所提出的快速计算基本图像操作不仅增强了变形对数恶魔的计算能力，而且还可能扩展到加速许多其他基于强度的方法。我们的实现是开源的，可在 GitHub 上获得，网址为 https://bit.ly/2PYZxQz。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b72b/7946684/648b8415f4fc/11548_2020_2303_Fig1_HTML.jpg

相似文献

Performance-aware programming for intraoperative intensity-based image registration on graphics processing units.

Int J Comput Assist Radiol Surg. 2021 Mar;16(3):375-386. doi: 10.1007/s11548-020-02303-y. Epub 2021 Jan 23.

High performance computing for deformable image registration: towards a new paradigm in adaptive radiotherapy.

Med Phys. 2008 Aug;35(8):3546-53. doi: 10.1118/1.2948318.

GPU accelerated generation of digitally reconstructed radiographs for 2-D/3-D image registration.

IEEE Trans Biomed Eng. 2012 Sep;59(9):2594-603. doi: 10.1109/TBME.2012.2207898. Epub 2012 Jul 11.

A fast forward projection using multithreads for multirays on GPUs in medical image reconstruction.

Med Phys. 2011 Jul;38(7):4052-65. doi: 10.1118/1.3591994.

Ultra-fast digital tomosynthesis reconstruction using general-purpose GPU programming for image-guided radiation therapy.

Technol Cancer Res Treat. 2011 Aug;10(4):295-306. doi: 10.7785/tcrt.2012.500206.

Accelerating B-spline interpolation on GPUs: Application to medical image registration.

Comput Methods Programs Biomed. 2020 Sep;193:105431. doi: 10.1016/j.cmpb.2020.105431. Epub 2020 Mar 3.

Fast polyenergetic forward projection for image formation using OpenCL on a heterogeneous parallel computing platform.

Med Phys. 2012 Nov;39(11):6745-56. doi: 10.1118/1.4758062.

GPU-based streaming architectures for fast cone-beam CT image reconstruction and demons deformable registration.

Phys Med Biol. 2007 Oct 7;52(19):5771-83. doi: 10.1088/0031-9155/52/19/003. Epub 2007 Sep 10.

Implementation and evaluation of various demons deformable image registration algorithms on a GPU.

Phys Med Biol. 2010 Jan 7;55(1):207-19. doi: 10.1088/0031-9155/55/1/012.

Large-scale neural circuit mapping data analysis accelerated with the graphical processing unit (GPU).

J Neurosci Methods. 2015 Jan 15;239:1-10. doi: 10.1016/j.jneumeth.2014.09.022. Epub 2014 Sep 30.

本文引用的文献

Techniques for Stereotactic Neurosurgery: Beyond the Frame, Toward the Intraoperative Magnetic Resonance Imaging-Guided and Robot-Assisted Approaches.

World Neurosurg. 2018 Aug;116:77-87. doi: 10.1016/j.wneu.2018.04.155. Epub 2018 May 3.

Deep Adaptive Log-Demons: Diffeomorphic Image Registration with Very Large Deformations.

Comput Math Methods Med. 2015;2015:836202. doi: 10.1155/2015/836202. Epub 2015 May 18.

A new fast accurate nonlinear medical image registration program including surface preserving regularization.

IEEE Trans Med Imaging. 2014 Nov;33(11):2118-27. doi: 10.1109/TMI.2014.2332370. Epub 2014 Jun 23.

The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository.

J Digit Imaging. 2013 Dec;26(6):1045-57. doi: 10.1007/s10278-013-9622-7.

Deformable medical image registration: a survey.

IEEE Trans Med Imaging. 2013 Jul;32(7):1153-90. doi: 10.1109/TMI.2013.2265603. Epub 2013 May 31.

Identification and acute targeting of gaps in atrial ablation lesion sets using a real-time magnetic resonance imaging system.

Circ Arrhythm Electrophysiol. 2012 Dec;5(6):1130-5. doi: 10.1161/CIRCEP.112.973164. Epub 2012 Oct 15.

3D Slicer as an image computing platform for the Quantitative Imaging Network.

Magn Reson Imaging. 2012 Nov;30(9):1323-41. doi: 10.1016/j.mri.2012.05.001. Epub 2012 Jul 6.

A Demons algorithm for image registration with locally adaptive regularization.

Med Image Comput Comput Assist Interv. 2009;12(Pt 1):574-81. doi: 10.1007/978-3-642-04268-3_71.

Review of intraoperative imaging and planning techniques in permanent seed prostate brachytherapy.

Radiother Oncol. 2010 Jan;94(1):12-23. doi: 10.1016/j.radonc.2009.12.012. Epub 2010 Jan 13.

Implementation and evaluation of various demons deformable image registration algorithms on a GPU.

Phys Med Biol. 2010 Jan 7;55(1):207-19. doi: 10.1088/0031-9155/55/1/012.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于图形处理单元的术中基于强度的图像配准的性能感知编程。

Performance-aware programming for intraoperative intensity-based image registration on graphics processing units.

机构信息

Department of Mechanical Engineering, The University of Hong Kong, Pok Fu Lam, Hong Kong.

Department of Computing, Imperial College London, London, SW7 2AZ, UK.