Suppr超能文献

用于离散小波变换的精确感知自量化硬件架构。

Precision-aware self-quantizing hardware architectures for the discrete wavelet transform.

机构信息

Mojix Inc., Los Angeles, CA 90025, USA.

出版信息

IEEE Trans Image Process. 2012 Feb;21(2):768-77. doi: 10.1109/TIP.2011.2163519. Epub 2011 Aug 4.

Abstract

This paper presents designs for both bit-parallel (BP) and digit-serial (DS) precision-optimized implementations of the discrete wavelet transform (DWT), with specific consideration given to the impact of depth (the number of levels of DWT) on the overall computational accuracy. These methods thus allow customizing the precision of a multilevel DWT to a given error tolerance requirement and ensuring an energy-minimal implementation, which increases the applicability of DWT-based algorithms such as JPEG 2000 to energy-constrained platforms and environments. Additionally, quantization of DWT coefficients to a specific target step size is performed as an inherent part of the DWT computation, thereby eliminating the need to have a separate downstream quantization step in applications such as JPEG 2000. Experimental measurements of design performance in terms of area, speed, and power for 90-nm complementary metal-oxide-semiconductor implementation are presented. Results indicate that while BP designs exhibit inherent speed advantages, DS designs require significantly fewer hardware resources with increasing precision and DWT level. A four-level DWT with medium precision, for example, while the BP design is four times faster than the digital-serial design, occupies twice the area. In addition to the BP and DS designs, a novel flexible DWT processor is presented, which supports run-time configurable DWT parameters.

摘要

本文提出了两种位并行 (BP) 和数字串行 (DS) 精度优化的离散小波变换 (DWT) 设计,特别考虑了深度 (DWT 的级数) 对整体计算精度的影响。这些方法允许根据给定的误差容限要求自定义多级 DWT 的精度,并确保实现能量最小化,从而提高基于 DWT 的算法(如 JPEG 2000)在能量受限平台和环境中的适用性。此外,DWT 系数的量化到特定的目标步长作为 DWT 计算的固有部分执行,从而消除了在 JPEG 2000 等应用中需要单独的下游量化步骤的需求。针对 90nm 互补金属氧化物半导体实现,以面积、速度和功率为指标,对设计性能进行了实验测量。结果表明,虽然 BP 设计具有固有的速度优势,但随着精度和 DWT 级别的增加,DS 设计需要的硬件资源要少得多。例如,四级中精度 DWT,BP 设计比数字串行设计快四倍,但占用的面积却是后者的两倍。除了 BP 和 DS 设计,本文还提出了一种新颖的灵活 DWT 处理器,它支持运行时可配置的 DWT 参数。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验