Information Technologies and Programming Faculty, ITMO University, Kronverksky Pr. 49, bldg. A, St. Petersburg 197101, Russia.
Sensors (Basel). 2023 Jan 26;23(3):1368. doi: 10.3390/s23031368.
This paper is dedicated to video coding based on a compressive sensing (CS) framework. In CS, it is assumed that if a video sequence is sparse in some transform domain, then it could be reconstructed from a much lower number of samples (called measurements) than the Nyquist-Shannon theorem requires. Here, the performance of such a codec depends on how the measurements are acquired (or sensed) and compressed and how the video is reconstructed from the decoded measurements. Here, such a codec potentially could provide significantly faster encoding compared with traditional block-based intra-frame encoding via Motion JPEG (MJPEG), H.264/AVC or H.265/HEVC standards. However, existing video codecs based on CS are inferior to the traditional codecs in rate distortion performance, which makes them useless in practical scenarios. In this paper, we present a video codec based on CS called CS-JPEG. To the author's knowledge, CS-JPEG is the first codec based on CS, combining fast encoding and high rate distortion results. Our performance evaluation shows that, compared with the optimized software implementations of MJPEG, H.264/AVC, and H.265/HEVC, the proposed CS-JPEG encoding is 2.2, 1.9, and 30.5 times faster, providing 2.33, 0.79, and 1.45 dB improvements in the peak signal-to-noise ratio, respectively. Therefore, it could be more attractive for video applications having critical limitations in computational resources or a battery lifetime of an upstreaming device.
本文致力于基于压缩感知 (CS) 框架的视频编码。在 CS 中,假设如果视频序列在某些变换域中是稀疏的,那么它可以从比奈奎斯特-香农定理要求的更低数量的样本(称为测量值)中重建。在这里,这种编解码器的性能取决于测量值是如何获取(或感知)和压缩的,以及视频是如何从解码后的测量值中重建的。在这里,与传统的基于块的帧内编码(通过 Motion JPEG (MJPEG)、H.264/AVC 或 H.265/HEVC 标准)相比,这种编解码器有可能提供显著更快的编码速度。然而,现有的基于 CS 的视频编解码器在率失真性能方面劣于传统编解码器,这使得它们在实际场景中无用。在本文中,我们提出了一种称为 CS-JPEG 的基于 CS 的视频编解码器。据作者所知,CS-JPEG 是第一个基于 CS 的编解码器,它结合了快速编码和高率失真性能。我们的性能评估表明,与 MJPEG、H.264/AVC 和 H.265/HEVC 的优化软件实现相比,所提出的 CS-JPEG 编码分别快 2.2、1.9 和 30.5 倍,分别提供 2.33、0.79 和 1.45 dB 的峰值信噪比提高。因此,对于在计算资源或上行设备电池寿命方面存在关键限制的视频应用来说,它可能更具吸引力。