Zhang Kang, Kang Jin U
Department of Electrical and Computer Engineering, The Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD 21218, USA.
Opt Express. 2010 May 24;18(11):11772-84. doi: 10.1364/OE.18.011772.
We realized graphics processing unit (GPU) based real-time 4D (3D+time) signal processing and visualization on a regular Fourier-domain optical coherence tomography (FD-OCT) system with a nonlinear k-space spectrometer. An ultra-high speed linear spline interpolation (LSI) method for lambda-to-k spectral re-sampling is implemented in the GPU architecture, which gives average interpolation speeds of >3,000,000 line/s for 1024-pixel OCT (1024-OCT) and >1,400,000 line/s for 2048-pixel OCT (2048-OCT). The complete FD-OCT signal processing including lambda-to-k spectral re-sampling, fast Fourier transform (FFT) and post-FFT processing have all been implemented on a GPU. The maximum complete A-scan processing speeds are investigated to be 680,000 line/s for 1024-OCT and 320,000 line/s for 2048-OCT, which correspond to 1GByte processing bandwidth. In our experiment, a 2048-pixel CMOS camera running up to 70 kHz is used as an acquisition device. Therefore the actual imaging speed is camera- limited to 128,000 line/s for 1024-OCT or 70,000 line/s for 2048-OCT. 3D Data sets are continuously acquired in real time at 1024-OCT mode, immediately processed and visualized as high as 10 volumes/second (12,500 A-scans/volume) by either en face slice extraction or ray-casting based volume rendering from 3D texture mapped in graphics memory. For standard FD-OCT systems, a GPU is the only additional hardware needed to realize this improvement and no optical modification is needed. This technique is highly cost-effective and can be easily integrated into most ultrahigh speed FD-OCT systems to overcome the 3D data processing and visualization bottlenecks.
我们在具有非线性k空间光谱仪的常规傅里叶域光学相干断层扫描(FD-OCT)系统上实现了基于图形处理单元(GPU)的实时4D(3D + 时间)信号处理和可视化。在GPU架构中实现了一种用于λ到k光谱重采样的超高速线性样条插值(LSI)方法,对于1024像素OCT(1024-OCT),其平均插值速度大于3,000,000线/秒,对于2048像素OCT(2048-OCT),平均插值速度大于1,400,000线/秒。包括λ到k光谱重采样、快速傅里叶变换(FFT)和FFT后处理在内的完整FD-OCT信号处理均已在GPU上实现。对于1024-OCT,最大完整A扫描处理速度经研究为680,000线/秒,对于2048-OCT为320,000线/秒,这对应于1GB的处理带宽。在我们的实验中,使用运行频率高达70kHz的2048像素CMOS相机作为采集设备。因此,对于1024-OCT,实际成像速度受相机限制为128,000线/秒,对于2048-OCT为70,000线/秒。在1024-OCT模式下实时连续采集3D数据集,通过从图形内存中映射的3D纹理进行表面切片提取或基于光线投射的体绘制,立即进行处理并以高达10体积/秒(12,500 A扫描/体积)的速度进行可视化。对于标准FD-OCT系统,GPU是实现这一改进所需的唯一额外硬件,无需进行光学修改。该技术具有很高的成本效益,并且可以轻松集成到大多数超高速FD-OCT系统中,以克服3D数据处理和可视化瓶颈。