基于CUBLAS和CULA的用于生物发光断层成像的自适应有限元框架的GPU加速

The CUBLAS and CULA based GPU acceleration of adaptive finite element framework for bioluminescence tomography.

作者信息

Zhang Bo, Yang Xiang, Yang Fei, Yang Xin, Qin Chenghu, Han Dong, Ma Xibo, Liu Kai, Tian Jie

机构信息

Sino-Dutch Biomedical and Information Engineering School of Northeastern University, Shenyang, China.

出版信息

Opt Express. 2010 Sep 13;18(19):20201-14. doi: 10.1364/OE.18.020201.

DOI:10.1364/OE.18.020201

PMID:20940911

Abstract

In molecular imaging (MI), especially the optical molecular imaging, bioluminescence tomography (BLT) emerges as an effective imaging modality for small animal imaging. The finite element methods (FEMs), especially the adaptive finite element (AFE) framework, play an important role in BLT. The processing speed of the FEMs and the AFE framework still needs to be improved, although the multi-thread CPU technology and the multi CPU technology have already been applied. In this paper, we for the first time introduce a new kind of acceleration technology to accelerate the AFE framework for BLT, using the graphics processing unit (GPU). Besides the processing speed, the GPU technology can get a balance between the cost and performance. The CUBLAS and CULA are two main important and powerful libraries for programming on NVIDIA GPUs. With the help of CUBLAS and CULA, it is easy to code on NVIDIA GPU and there is no need to worry about the details about the hardware environment of a specific GPU. The numerical experiments are designed to show the necessity, effect and application of the proposed CUBLAS and CULA based GPU acceleration. From the results of the experiments, we can reach the conclusion that the proposed CUBLAS and CULA based GPU acceleration method can improve the processing speed of the AFE framework very much while getting a balance between cost and performance.

摘要

在分子成像（MI）中，尤其是光学分子成像领域，生物发光断层扫描（BLT）成为一种用于小动物成像的有效成像方式。有限元方法（FEM），特别是自适应有限元（AFE）框架，在BLT中发挥着重要作用。尽管多线程CPU技术和多CPU技术已经得到应用，但FEM和AFE框架的处理速度仍有待提高。在本文中，我们首次引入一种新型加速技术，即使用图形处理单元（GPU）来加速用于BLT的AFE框架。除了处理速度外，GPU技术还能在成本和性能之间取得平衡。CUBLAS和CULA是用于在NVIDIA GPU上进行编程的两个主要且强大的库。借助CUBLAS和CULA，在NVIDIA GPU上进行编码很容易，而且无需担心特定GPU硬件环境的细节。设计数值实验以展示所提出的基于CUBLAS和CULA的GPU加速的必要性、效果及应用。从实验结果可以得出结论，所提出的基于CUBLAS和CULA的GPU加速方法能够在很大程度上提高AFE框架的处理速度，同时在成本和性能之间取得平衡。

相似文献

The CUBLAS and CULA based GPU acceleration of adaptive finite element framework for bioluminescence tomography.基于CUBLAS和CULA的用于生物发光断层成像的自适应有限元框架的GPU加速

Opt Express. 2010 Sep 13;18(19):20201-14. doi: 10.1364/OE.18.020201.

A trust region method in adaptive finite element framework for bioluminescence tomography.用于生物发光断层成像的自适应有限元框架中的信赖域方法。

Opt Express. 2010 Mar 29;18(7):6477-91. doi: 10.1364/OE.18.006477.

Spectrally resolved bioluminescence tomography with adaptive finite element analysis: methodology and simulation.基于自适应有限元分析的光谱分辨生物发光断层成像：方法与模拟

Phys Med Biol. 2007 Aug 7;52(15):4497-512. doi: 10.1088/0031-9155/52/15/009. Epub 2007 Jul 3.

Performance and scalability of Fourier domain optical coherence tomography acceleration using graphics processing units.使用图形处理单元的傅里叶域光学相干断层扫描加速的性能与可扩展性

Appl Opt. 2011 May 1;50(13):1832-8. doi: 10.1364/AO.50.001832.

Equalizer: a scalable parallel rendering framework.均衡器：一个可扩展的并行渲染框架。

IEEE Trans Vis Comput Graph. 2009 May-Jun;15(3):436-52. doi: 10.1109/TVCG.2008.104.

Performance evaluation of image processing algorithms on the GPU.图像处理算法在图形处理器上的性能评估。

J Struct Biol. 2008 Oct;164(1):153-60. doi: 10.1016/j.jsb.2008.07.006. Epub 2008 Jul 24.

A fast bioluminescent source localization method based on generalized graph cuts with mouse model validations.一种基于广义图割并经小鼠模型验证的快速生物发光源定位方法。

Opt Express. 2010 Feb 15;18(4):3732-45. doi: 10.1364/OE.18.003732.

Implementation and performance evaluation of reconstruction algorithms on graphics processors.图形处理器上重建算法的实现与性能评估

J Struct Biol. 2007 Jan;157(1):288-95. doi: 10.1016/j.jsb.2006.08.010. Epub 2006 Sep 1.

High-speed nonlinear finite element analysis for surgical simulation using graphics processing units.使用图形处理单元进行手术模拟的高速非线性有限元分析

IEEE Trans Med Imaging. 2008 May;27(5):650-63. doi: 10.1109/TMI.2007.913112.

GPU-accelerated FDTD modeling of radio-frequency field-tissue interactions in high-field MRI.GPU 加速的高频 MRI 中射频场-组织相互作用的 FDTD 建模。

IEEE Trans Biomed Eng. 2011 Jun;58(6):1789-96. doi: 10.1109/TBME.2011.2116020. Epub 2011 Feb 17.

引用本文的文献

GPU-based block-wise nonlocal means denoising for 3D ultrasound images.基于 GPU 的块式非局部均值去噪方法用于三维超声图像。

Comput Math Methods Med. 2013;2013:921303. doi: 10.1155/2013/921303. Epub 2013 Nov 3.

Toward real-time availability of 3D temperature maps created with temporally constrained reconstruction.迈向通过时间约束重建实现3D温度图的实时可用性。

Magn Reson Med. 2014 Apr;71(4):1394-404. doi: 10.1002/mrm.24783. Epub 2013 May 13.

Acceleration of early-photon fluorescence molecular tomography with graphics processing units.基于图形处理单元的早期荧光分子断层成像加速。

Comput Math Methods Med. 2013;2013:297291. doi: 10.1155/2013/297291. Epub 2013 Mar 31.

High-performance image reconstruction in fluorescence tomography on desktop computers and graphics hardware.基于台式计算机和图形硬件的荧光断层成像中的高性能图像重建

Biomed Opt Express. 2011 Nov 1;2(11):3207-22. doi: 10.1364/BOE.2.003207. Epub 2011 Oct 28.

GPU-Accelerated Finite Element Method for Modelling Light Transport in Diffuse Optical Tomography.用于漫射光学层析成像中光传输建模的GPU加速有限元方法

Int J Biomed Imaging. 2011;2011:403892. doi: 10.1155/2011/403892. Epub 2011 Oct 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于CUBLAS和CULA的用于生物发光断层成像的自适应有限元框架的GPU加速

The CUBLAS and CULA based GPU acceleration of adaptive finite element framework for bioluminescence tomography.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献