• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

视频对象形状与透明度的高效编码。

Efficient coding of shape and transparency for video objects.

作者信息

Aghito Shankar Manuel, Forchhammer Søren

机构信息

Technical University of Denmark, COM DTU Department of Communications, Optics and Materials, DTU, 2800 Kgs. Lyngby, Denmark.

出版信息

IEEE Trans Image Process. 2007 Sep;16(9):2234-44. doi: 10.1109/tip.2007.903902.

DOI:10.1109/tip.2007.903902
PMID:17784597
Abstract

A novel scheme for coding gray-level alpha planes in object-based video is presented. Gray-level alpha planes convey the shape and the transparency information, which are required for smooth composition of video objects. The algorithm proposed is based on the segmentation of the alpha plane in three layers: binary shape layer, opaque layer, and intermediate layer. Thus, the latter two layers replace the single transparency layer of MPEG-4 Part 2. Different encoding schemes are specifically designed for each layer, utilizing cross-layer correlations to reduce the bit rate. First, the binary shape layer is processed by a novel video shape coder. In intra mode, the DSLSC binary image coder presented in [3] is used. This is extended here with an intermode utilizing temporal redundancies in shape image sequences. Then the opaque layer is compressed by a newly designed scheme which models the strong correlation with the binary shape layer by morphological erosion operations. Finally, three solutions are proposed for coding the intermediate layer. The knowledge of the two previously encoded layers is utilized in order to increase compression efficiency. Experimental results are reported demonstrating that the proposed techniques provide substantial bit rate savings coding shape and transparency when compared to the tools adopted in MPEG-4 Part 2.

摘要

提出了一种用于基于对象的视频中灰度α平面编码的新方案。灰度α平面传达了视频对象平滑合成所需的形状和透明度信息。所提出的算法基于将α平面分割为三层:二进制形状层、不透明层和中间层。因此,后两层取代了MPEG-4第2部分中的单一透明度层。针对每层专门设计了不同的编码方案,利用跨层相关性来降低比特率。首先,通过一种新颖的视频形状编码器处理二进制形状层。在帧内模式下,使用[3]中提出的DSLSC二进制图像编码器。在此通过利用形状图像序列中的时间冗余的帧间模式对其进行扩展。然后,通过一种新设计的方案对不透明层进行压缩,该方案通过形态侵蚀操作对与二进制形状层的强相关性进行建模。最后,提出了三种对中间层进行编码的解决方案。利用先前编码的两层的知识以提高压缩效率。报告的实验结果表明,与MPEG-4第2部分中采用的工具相比,所提出的技术在编码形状和透明度时可大幅节省比特率。

相似文献

1
Efficient coding of shape and transparency for video objects.视频对象形状与透明度的高效编码。
IEEE Trans Image Process. 2007 Sep;16(9):2234-44. doi: 10.1109/tip.2007.903902.
2
Error concealment for shape in MPEG-4 object-based video coding.基于MPEG-4对象的视频编码中形状的错误隐藏
IEEE Trans Image Process. 2005 Apr;14(4):389-96. doi: 10.1109/tip.2004.841197.
3
Adaptive shape and texture intra refreshment schemes for improved error resilience in object-based video coding.用于提高基于对象的视频编码中错误恢复能力的自适应形状和纹理帧内刷新方案。
IEEE Trans Image Process. 2004 May;13(5):662-76. doi: 10.1109/tip.2004.826092.
4
Compound image compression for real-time computer screen image transmission.用于实时计算机屏幕图像传输的复合图像压缩
IEEE Trans Image Process. 2005 Aug;14(8):993-1005. doi: 10.1109/tip.2005.849776.
5
Context-based coding of bilevel images enhanced by digital straight line analysis.基于上下文的二值图像编码,通过数字直线分析增强。
IEEE Trans Image Process. 2006 Aug;15(8):2120-30. doi: 10.1109/tip.2006.875168.
6
Spatial shape error concealment for object-based image and video coding.基于对象的图像和视频编码中的空间形状错误隐藏
IEEE Trans Image Process. 2004 Apr;13(4):586-99. doi: 10.1109/tip.2004.823826.
7
Sampling-based correlation estimation for distributed source coding under rate and complexity constraints.速率和复杂度约束下分布式信源编码的基于采样的相关性估计
IEEE Trans Image Process. 2008 Nov;17(11):2122-37. doi: 10.1109/tip.2008.2004619.
8
Bayesian resolution enhancement of compressed video.压缩视频的贝叶斯分辨率增强
IEEE Trans Image Process. 2004 Jul;13(7):898-911. doi: 10.1109/tip.2004.827230.
9
Blind MPEG-2 video watermarking robust against geometric attacks: a set of approaches in DCT domain.针对几何攻击具有鲁棒性的盲MPEG-2视频水印:离散余弦变换(DCT)域中的一组方法。
IEEE Trans Image Process. 2006 Jun;15(6):1536-43. doi: 10.1109/tip.2006.873476.
10
Mutual information-based analysis of JPEG2000 contexts.基于互信息的JPEG2000上下文分析。
IEEE Trans Image Process. 2005 Apr;14(4):411-22. doi: 10.1109/tip.2004.841199.

引用本文的文献

1
Machine Vision Methods, Natural Language Processing, and Machine Learning Algorithms for Automated Dispersion Plot Analysis and Chemical Identification from Complex Mixtures.机器视觉方法、自然语言处理和机器学习算法在复杂混合物的自动散布图分析和化学识别中的应用。
Anal Chem. 2019 Aug 20;91(16):10509-10517. doi: 10.1021/acs.analchem.9b01428. Epub 2019 Jul 29.