Huang Chen-Hsiu, Wu Ja-Ling
Department of Computer Science and Information Engineering, National Taiwan University, Taipei 106, Taiwan.
Entropy (Basel). 2024 Apr 24;26(5):357. doi: 10.3390/e26050357.
End-to-end learned image compression codecs have notably emerged in recent years. These codecs have demonstrated superiority over conventional methods, showcasing remarkable flexibility and adaptability across diverse data domains while supporting new distortion losses. Despite challenges such as computational complexity, learned image compression methods inherently align with learning-based data processing and analytic pipelines due to their well-suited internal representations. The concept of Video Coding for Machines has garnered significant attention from both academic researchers and industry practitioners. This concept reflects the growing need to integrate data compression with computer vision applications. In light of these developments, we present a comprehensive survey and review of lossy image compression methods. Additionally, we provide a concise overview of two prominent international standards, MPEG Video Coding for Machines and JPEG AI. These standards are designed to bridge the gap between data compression and computer vision, catering to practical industry use cases.
近年来,端到端学习的图像压缩编解码器显著涌现。这些编解码器已证明优于传统方法,在支持新的失真损失的同时,在不同数据领域展现出显著的灵活性和适应性。尽管存在计算复杂度等挑战,但由于其内部表示非常合适,基于学习的图像压缩方法本质上与基于学习的数据处理和分析管道相契合。机器视频编码的概念已引起学术研究人员和行业从业者的广泛关注。这一概念反映了将数据压缩与计算机视觉应用集成的需求不断增长。鉴于这些发展,我们对有损图像压缩方法进行了全面的调查和综述。此外,我们简要概述了两个重要的国际标准,即机器的MPEG视频编码和JPEG AI。这些标准旨在弥合数据压缩与计算机视觉之间的差距,以满足实际行业用例的需求。