Signal Process. Res. Dept., AT&T Bell Labs., Murray Hill, NJ.
IEEE Trans Image Process. 1995;4(2):125-39. doi: 10.1109/83.342187.
We describe and show the results of video coding based on a three-dimensional (3-D) spatio-temporal subband decomposition. The results include a 1-Mbps coder based on a new adaptive differential pulse code modulation scheme (ADPCM) and adaptive bit allocation. This rate is useful for video storage on CD-ROM. Coding results are also shown for a 384-kbps rate that are based on ADPCM for the lowest frequency band and a new form of vector quantization (geometric vector quantization (GVQ)) for the data in the higher frequency bands. GVQ takes advantage of the inherent structure and sparseness of the data in the higher bands. Results are also shown for a 128-kbps coder that is based on an unbalanced tree-structured vector quantizer (UTSVQ) for the lowest frequency band and GVQ for the higher frequency bands. The results are competitive with traditional video coding techniques and provide the motivation for investigating the 3-D subband framework for different coding schemes and various applications.
我们描述并展示了基于三维(3-D)时空子带分解的视频编码的结果。结果包括一个基于新自适应差分脉冲编码调制方案(ADPCM)和自适应比特分配的 1Mbps 编码器。该速率可用于 CD-ROM 上的视频存储。还展示了用于 384kbps 速率的编码结果,该速率基于最低频带的 ADPCM 和较高频带的一种新形式的矢量量化(几何矢量量化(GVQ))。GVQ 利用了较高频带中数据的固有结构和稀疏性。还展示了一个 128kbps 编码器的结果,该编码器基于用于最低频带的不平衡树状矢量量化器(UTSVQ)和用于较高频带的 GVQ。结果与传统的视频编码技术具有竞争力,并为研究不同编码方案和各种应用的 3-D 子带框架提供了动力。