图像处理算法在图形处理器上的性能评估。

Performance evaluation of image processing algorithms on the GPU.

作者信息

Castaño-Díez Daniel, Moser Dominik, Schoenegger Andreas, Pruggnaller Sabine, Frangakis Achilleas S

机构信息

Computational and Structural Biology, European Molecular Biology Laboratory, Meyerhofstr. 1, 69117 Heidelberg, Germany.

出版信息

J Struct Biol. 2008 Oct;164(1):153-60. doi: 10.1016/j.jsb.2008.07.006. Epub 2008 Jul 24.

DOI:10.1016/j.jsb.2008.07.006

PMID:18692140

Abstract

The graphics processing unit (GPU), which originally was used exclusively for visualization purposes, has evolved into an extremely powerful co-processor. In the meanwhile, through the development of elaborate interfaces, the GPU can be used to process data and deal with computationally intensive applications. The speed-up factors attained compared to the central processing unit (CPU) are dependent on the particular application, as the GPU architecture gives the best performance for algorithms that exhibit high data parallelism and high arithmetic intensity. Here, we evaluate the performance of the GPU on a number of common algorithms used for three-dimensional image processing. The algorithms were developed on a new software platform called "CUDA", which allows a direct translation from C code to the GPU. The implemented algorithms include spatial transformations, real-space and Fourier operations, as well as pattern recognition procedures, reconstruction algorithms and classification procedures. In our implementation, the direct porting of C code in the GPU achieves typical acceleration values in the order of 10-20 times compared to a state-of-the-art conventional processor, but they vary depending on the type of the algorithm. The gained speed-up comes with no additional costs, since the software runs on the GPU of the graphics card of common workstations.

摘要

图形处理单元（GPU）最初仅用于可视化目的，如今已发展成为功能极其强大的协处理器。与此同时，通过精心设计的接口开发，GPU可用于处理数据并应对计算密集型应用。与中央处理器（CPU）相比所实现的加速因子取决于具体应用，因为GPU架构对于展现出高数据并行度和高算术强度的算法能提供最佳性能。在此，我们评估GPU在一些用于三维图像处理的常见算法上的性能。这些算法是在一个名为“CUDA”的新软件平台上开发的，该平台允许将C代码直接转换到GPU上。所实现的算法包括空间变换、实空间和傅里叶运算，以及模式识别程序、重建算法和分类程序。在我们的实现中，与最先进的传统处理器相比，在GPU中直接移植C代码可实现典型的10到20倍的加速值，但它们会因算法类型而异。所获得的加速无需额外成本，因为该软件可在普通工作站显卡的GPU上运行。

相似文献

Performance evaluation of image processing algorithms on the GPU.图像处理算法在图形处理器上的性能评估。

J Struct Biol. 2008 Oct;164(1):153-60. doi: 10.1016/j.jsb.2008.07.006. Epub 2008 Jul 24.

Implementation and performance evaluation of reconstruction algorithms on graphics processors.图形处理器上重建算法的实现与性能评估

J Struct Biol. 2007 Jan;157(1):288-95. doi: 10.1016/j.jsb.2006.08.010. Epub 2006 Sep 1.

Performance and scalability of Fourier domain optical coherence tomography acceleration using graphics processing units.使用图形处理单元的傅里叶域光学相干断层扫描加速的性能与可扩展性

Appl Opt. 2011 May 1;50(13):1832-8. doi: 10.1364/AO.50.001832.

GPU-based streaming architectures for fast cone-beam CT image reconstruction and demons deformable registration.基于图形处理器（GPU）的流架构用于快速锥束计算机断层扫描（CT）图像重建和戴蒙斯可变形配准。

Phys Med Biol. 2007 Oct 7;52(19):5771-83. doi: 10.1088/0031-9155/52/19/003. Epub 2007 Sep 10.

Graphics processing unit accelerated computation of digital holograms.图形处理单元加速数字全息图的计算。

Appl Opt. 2009 Dec 1;48(34):H137-43. doi: 10.1364/AO.48.00H137.

GPU based real-time quadrature transform method for 3-D surface measurement and visualization.基于GPU的三维表面测量与可视化实时正交变换方法

Opt Express. 2011 Jun 20;19(13):12125-30. doi: 10.1364/OE.19.012125.

Mapping high-fidelity volume rendering for medical imaging to CPU, GPU and many-core architectures.医学成像的高保真体绘制映射到 CPU、GPU 和多核架构。

IEEE Trans Vis Comput Graph. 2009 Nov-Dec;15(6):1563-70. doi: 10.1109/TVCG.2009.164.

A matrix approach to tomographic reconstruction and its implementation on GPUs.一种层析重建的矩阵方法及其在 GPU 上的实现。

J Struct Biol. 2010 Apr;170(1):146-51. doi: 10.1016/j.jsb.2010.01.021. Epub 2010 Feb 2.

High performance computing for deformable image registration: towards a new paradigm in adaptive radiotherapy.用于可变形图像配准的高性能计算：迈向自适应放射治疗的新范式。

Med Phys. 2008 Aug;35(8):3546-53. doi: 10.1118/1.2948318.

A streaming narrow-band algorithm: interactive computation and visualization of level sets.一种流窄带算法：水平集的交互式计算与可视化

IEEE Trans Vis Comput Graph. 2004 Jul-Aug;10(4):422-33. doi: 10.1109/TVCG.2004.2.

引用本文的文献

The big chill: Growth of structural biology with cryo-electron tomography.大变革：冷冻电子断层扫描技术助力结构生物学发展

QRB Discov. 2024 Dec 13;5:e10. doi: 10.1017/qrd.2024.10. eCollection 2024.

Topographic design in wearable MXene sensors with in-sensor machine learning for full-body avatar reconstruction.可穿戴 MXene 传感器中的地形设计与传感器内机器学习相结合，用于全身化身重建。

Nat Commun. 2022 Sep 9;13(1):5311. doi: 10.1038/s41467-022-33021-5.

Ultrasound-based liver tracking utilizing a hybrid template/optical flow approach.基于超声的肝脏跟踪，利用混合模板/光流方法。

Int J Comput Assist Radiol Surg. 2018 Oct;13(10):1605-1615. doi: 10.1007/s11548-018-1780-0. Epub 2018 Jun 5.

Accelerated cryo-EM structure determination with parallelisation using GPUs in RELION-2.在RELION-2中使用图形处理器（GPU）并行化加速冷冻电镜结构测定

Elife. 2016 Nov 15;5:e18722. doi: 10.7554/eLife.18722.

A survey of GPU-based medical image computing techniques.基于 GPU 的医学图像处理技术综述。

Quant Imaging Med Surg. 2012 Sep;2(3):188-206. doi: 10.3978/j.issn.2223-4292.2012.08.02.

Evaluation of a multicore-optimized implementation for tomographic reconstruction.评估一种多核优化的层析重建实现。

PLoS One. 2012;7(11):e48261. doi: 10.1371/journal.pone.0048261. Epub 2012 Nov 6.

Automatic alignment and reconstruction of images for soft X-ray tomography.软 X 射线断层摄影术的图像自动配准和重建。

J Struct Biol. 2012 Feb;177(2):259-66. doi: 10.1016/j.jsb.2011.11.027. Epub 2011 Dec 2.

A distributed multi-GPU system for high speed electron microscopic tomographic reconstruction.一种用于高速电子显微镜断层重建的分布式多 GPU 系统。

Ultramicroscopy. 2011 Jul;111(8):1137-43. doi: 10.1016/j.ultramic.2011.03.015. Epub 2011 Apr 1.

GPU-enabled FREALIGN: accelerating single particle 3D reconstruction and refinement in Fourier space on graphics processors.GPU 加速的 FREALIGN：在图形处理器上的傅里叶空间中加速单颗粒 3D 重构和精修。

J Struct Biol. 2010 Dec;172(3):407-12. doi: 10.1016/j.jsb.2010.06.010. Epub 2010 Jun 15.

An adaptive Expectation-Maximization algorithm with GPU implementation for electron cryomicroscopy.基于 GPU 实现的电子冷冻显微镜的自适应期望最大化算法。

J Struct Biol. 2010 Sep;171(3):256-65. doi: 10.1016/j.jsb.2010.06.004. Epub 2010 Jun 9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

图像处理算法在图形处理器上的性能评估。

Performance evaluation of image processing algorithms on the GPU.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献