Basoglu C, Kim Y, Chalana V
Department of Electrical Engineering, University of Washington, Seattle 98195-2500, USA.
Ultrason Imaging. 1996 Oct;18(4):241-60. doi: 10.1177/016173469601800402.
We have developed a new ultrasound scan conversion algorithm that can be executed very efficiently on modern microprocessors. Our algorithm is designed to handle the address calculations and input and output (I/O) data loading concurrently with the interpolation. The processing unit's computing power can be dedicated to performing pixel interpolations while the other operations are handled by an independent direct memory access (DMA) controller. By making intelligent use of the I/O transfer capabilities of the DMA controller, the algorithm avoids spending the processing unit's valuable computing cycles in address calculations and nonactive pixel blanking. Furthermore, the new approach speeds up the computation by utilizing the ability of superscalar and very long instruction word (VLIW) processors to perform multiple operations in parallel. Our scan conversion algorithm was implemented on a multimedia and imaging system based on the Texas Instruments TMS320C80 Multimedia Video Processor (MVP). Computing cycles are spent only on predeterminable nonzero output pixels. For example, an execution time of 11.4 ms was achieved when there are 101,829 nonzero output pixels. This algorithm demonstrates a substantial improvement over previous scan conversion algorithms, and its optimized implementation enables modern commercially available programmable processors to support scan conversion at video rates.
我们开发了一种新的超声扫描转换算法,该算法能够在现代微处理器上高效执行。我们的算法旨在将地址计算以及输入输出(I/O)数据加载与插值操作同时进行处理。处理单元的计算能力可专门用于执行像素插值,而其他操作则由独立的直接内存访问(DMA)控制器处理。通过巧妙利用DMA控制器的I/O传输能力,该算法避免了在地址计算和非有效像素消隐上消耗处理单元宝贵的计算周期。此外,这种新方法通过利用超标量和超长指令字(VLIW)处理器并行执行多个操作的能力来加速计算。我们的扫描转换算法在基于德州仪器TMS320C80多媒体视频处理器(MVP)的多媒体和成像系统上得以实现。计算周期仅花费在可预先确定的非零输出像素上。例如,当有101,829个非零输出像素时,执行时间达到了11.4毫秒。该算法相较于先前的扫描转换算法有显著改进,其优化实现使现代商用可编程处理器能够支持视频速率下的扫描转换。