• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

端到端优化的感兴趣区域图像压缩

End-to-end Optimized ROI Image Compression.

作者信息

Cai Chunlei, Chen Li, Zhang Xiaoyun, Gao Zhiyong

出版信息

IEEE Trans Image Process. 2019 Dec 25. doi: 10.1109/TIP.2019.2960869.

DOI:10.1109/TIP.2019.2960869
PMID:31880554
Abstract

Compressing an image with more bits automatically allocated to the region of interest (ROI) than to the background can both protect key information and reduce substantial redundancy. This paper models ROI image compression as an optimization problem of minimizing a weighted sum of the rate of the image and distortion of the ROI. The traditional framework solves this problem by cascading ROI prediction and ROI coding, through which achieving the optimized solution is impossible. To improve coding performance, we propose a novel deep-learning-based unified framework that can achieve rate distortion optimization for ROI compression. Specifically, the proposed framework includes a pair of ROI encoder and decoder convolutional neural networks and a learned entropy codec. The encoder network simultaneously generates multiscale representations that support efficient rate allocation and an implicit ROI mask that guides rate allocation. The proposed framework can automatically complete ROI image compression, and it can be optimized from data in an end-to-end manner. To effectively train the framework by back propagation, we develop a soft-to-hard ROI prediction scheme to make the entire framework differential. To improve visual quality, we propose a hierarchical distortion loss function to protect both pixel-level fidelity for ROI and structural similarity for the entire image. The proposed framework is implemented in two scenarios: salient-target and face-target ROI compression. Comparative experiments demonstrate the advantages of the proposed framework over the traditional framework, including considerably better subjective visual quality, significantly higher objective ROI compression performance and execution efficiency.

摘要

对分配给感兴趣区域(ROI)的比特数多于背景的图像进行压缩,既能保护关键信息,又能减少大量冗余。本文将ROI图像压缩建模为一个优化问题,即最小化图像比特率和ROI失真的加权和。传统框架通过级联ROI预测和ROI编码来解决这个问题,但这样无法实现最优解。为了提高编码性能,我们提出了一种基于深度学习的新型统一框架,该框架可以实现ROI压缩的率失真优化。具体来说,所提出的框架包括一对ROI编码器和解码器卷积神经网络以及一个学习到的熵编码解码器。编码器网络同时生成支持高效比特率分配的多尺度表示和指导比特率分配的隐式ROI掩码。所提出的框架可以自动完成ROI图像压缩,并且可以以端到端的方式从数据中进行优化。为了通过反向传播有效地训练该框架,我们开发了一种从软到硬的ROI预测方案,以使整个框架具有可微性。为了提高视觉质量,我们提出了一种分层失真损失函数,以保护ROI的像素级保真度和整个图像的结构相似性。所提出的框架在两种场景中实现:显著目标和面部目标ROI压缩。对比实验证明了所提出框架相对于传统框架的优势,包括明显更好的主观视觉质量、显著更高的客观ROI压缩性能和执行效率。

相似文献

1
End-to-end Optimized ROI Image Compression.端到端优化的感兴趣区域图像压缩
IEEE Trans Image Process. 2019 Dec 25. doi: 10.1109/TIP.2019.2960869.
2
An End-to-End Learning Framework for Video Compression.一种用于视频压缩的端到端学习框架。
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3292-3308. doi: 10.1109/TPAMI.2020.2988453. Epub 2021 Sep 2.
3
Enhanced Standard Compatible Image Compression Framework Based on Auxiliary Codec Networks.基于辅助编解码器网络的增强型标准兼容图像压缩框架
IEEE Trans Image Process. 2022;31:664-677. doi: 10.1109/TIP.2021.3134473. Epub 2021 Dec 28.
4
Image Compression Based on Hybrid Domain Attention and Postprocessing Enhancement.基于混合域注意力和后处理增强的图像压缩。
Comput Intell Neurosci. 2022 Mar 17;2022:4926124. doi: 10.1155/2022/4926124. eCollection 2022.
5
Learning Content-Weighted Deep Image Compression.学习内容加权深度图像压缩
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3446-3461. doi: 10.1109/TPAMI.2020.2983926. Epub 2021 Sep 2.
6
End-to-End Rate-Distortion Optimized Learned Hierarchical Bi-Directional Video Compression.端到端率失真优化的学习分层双向视频压缩
IEEE Trans Image Process. 2022;31:974-983. doi: 10.1109/TIP.2021.3138300. Epub 2022 Jan 6.
7
Exploiting Intra-Slice and Inter-Slice Redundancy for Learning-Based Lossless Volumetric Image Compression.利用切片内和切片间冗余进行基于学习的无损体积图像压缩。
IEEE Trans Image Process. 2022;31:1697-1707. doi: 10.1109/TIP.2022.3140608. Epub 2022 Feb 7.
8
Semantic Perceptual Image Compression With a Laplacian Pyramid of Convolutional Networks.基于卷积网络拉普拉斯金字塔的语义感知图像压缩
IEEE Trans Image Process. 2021;30:4225-4237. doi: 10.1109/TIP.2021.3065244. Epub 2021 Apr 12.
9
l2 Restoration of l∞-decoded images via soft-decision estimation.通过软判决估计恢复 l∞ 解码图像。
IEEE Trans Image Process. 2012 Dec;21(12):4797-807. doi: 10.1109/TIP.2012.2202672. Epub 2012 Jun 5.
10
A Game Theory Based CTU-Level Bit Allocation Scheme for HEVC Region of Interest Coding.一种基于博弈论的用于高效视频编码(HEVC)感兴趣区域编码的编码树单元(CTU)级比特分配方案
IEEE Trans Image Process. 2021;30:794-805. doi: 10.1109/TIP.2020.3038515. Epub 2020 Dec 4.

引用本文的文献

1
Improved Perceptual Quality of Traffic Signs and Lights for the Teleoperation of Autonomous Vehicle Remote Driving via Multi-Category Region of Interest Video Compression.通过多类别感兴趣区域视频压缩提升自动驾驶远程驾驶中交通标志和信号灯的感知质量
Entropy (Basel). 2025 Jun 24;27(7):674. doi: 10.3390/e27070674.
2
Hybrid deep learning architecture for scalable and high-quality image compression.用于可扩展和高质量图像压缩的混合深度学习架构。
Sci Rep. 2025 Jul 2;15(1):22926. doi: 10.1038/s41598-025-06481-0.
3
SEB-YOLO: An Improved YOLOv5 Model for Remote Sensing Small Target Detection.
SEB-YOLO:一种用于遥感小目标检测的改进YOLOv5模型。
Sensors (Basel). 2024 Mar 29;24(7):2193. doi: 10.3390/s24072193.
4
Medical image fusion quality assessment based on conditional generative adversarial network.基于条件生成对抗网络的医学图像融合质量评估
Front Neurosci. 2022 Aug 9;16:986153. doi: 10.3389/fnins.2022.986153. eCollection 2022.