Seo Juyeon, Lee Jaewoo, Lee Juhyun, Ko Hyunsuk
Opt Express. 2023 Jul 17;31(15):24573-24597. doi: 10.1364/OE.494835.
The field of digital holography has been significant developed in recent decades, however, the commercialization of digital holograms is still hindered by the issue of large data sizes. Due to the complex signal characteristics of digital holograms, which are of interferometric nature, traditional codecs are not able to provide satisfactory coding efficiency. Furthermore, in a typical coding scenario, the hologram is encoded and then decoded, leading to a numerical reconstruction via a light wave propagation model. While previous researches have mainly focused on the quality of the decoded hologram, it is the numerical reconstruction that is visible to the viewer, and thus, its quality must also be taken into consideration when designing a coding solution. In this study, the coding performances of existing compression standards, JPEG2000 and HEVC-Intra, are evaluated on a set of digital holograms, then the limitations of these standards are analyzed. Subsequently, we propose a deep learning-based compression network for full-complex holograms that demonstrates superior coding performance when compared to the latest standard codecs such as VVC and JPEG-XL, in addition to JPEG2000 and HEVC. The proposed network incorporates not only the quality of the decoded hologram, but also the quality of the numerical reconstruction as distortion costs for network training. The experimental results validate that the proposed network provides superior objective coding efficiency and better visual quality compared to the existing methods.
近几十年来,数字全息领域得到了显著发展,然而,数字全息图的商业化仍受到大数据量问题的阻碍。由于数字全息图具有干涉性质的复杂信号特征,传统编解码器无法提供令人满意的编码效率。此外,在典型的编码场景中,全息图先被编码然后解码,通过光波传播模型进行数值重建。虽然先前的研究主要集中在解码全息图的质量上,但观众看到的是数值重建结果,因此,在设计编码解决方案时也必须考虑其质量。在本研究中,我们在一组数字全息图上评估了现有压缩标准JPEG2000和HEVC-Intra的编码性能,然后分析了这些标准的局限性。随后,我们提出了一种用于全复数全息图的基于深度学习的压缩网络,与最新的标准编解码器如VVC和JPEG-XL以及JPEG2000和HEVC相比,该网络具有卓越的编码性能。所提出的网络不仅将解码全息图的质量,还将数值重建的质量纳入网络训练的失真代价中。实验结果验证了与现有方法相比,所提出的网络具有卓越的客观编码效率和更好的视觉质量。