• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种嵌入式显著性图估计方案:应用于视频编码。

An embedded saliency map estimator scheme: application to video encoding.

作者信息

Tsapatsoulis Nicolas, Rapantzikos Konstantinos, Pattichis Constantinos

机构信息

Department of Computer Science, University of Cyprus, CY 1678, Cyprus.

出版信息

Int J Neural Syst. 2007 Aug;17(4):289-304. doi: 10.1142/S0129065707001147.

DOI:10.1142/S0129065707001147
PMID:17696293
Abstract

In this paper we propose a novel saliency-based computational model for visual attention. This model processes both top-down (goal directed) and bottom-up information. Processing in the top-down channel creates the so called skin conspicuity map and emulates the visual search for human faces performed by humans. This is clearly a goal directed task but is generic enough to be context independent. Processing in the bottom-up information channel follows the principles set by Itti et al. but it deviates from them by computing the orientation, intensity and color conspicuity maps within a unified multi-resolution framework based on wavelet subband analysis. In particular, we apply a wavelet based approach for efficient computation of the topographic feature maps. Given that wavelets and multiresolution theory are naturally connected the usage of wavelet decomposition for mimicking the center surround process in humans is an obvious choice. However, our implementation goes further. We utilize the wavelet decomposition for inline computation of the features (such as orientation angles) that are used to create the topographic feature maps. The bottom-up topographic feature maps and the top-down skin conspicuity map are then combined through a sigmoid function to produce the final saliency map. A prototype of the proposed model was realized through the TMDSDMK642-0E DSP platform as an embedded system allowing real-time operation. For evaluation purposes, in terms of perceived visual quality and video compression improvement, a ROI-based video compression setup was followed. Extended experiments concerning both MPEG-1 as well as low bit-rate MPEG-4 video encoding were conducted showing significant improvement in video compression efficiency without perceived deterioration in visual quality.

摘要

在本文中,我们提出了一种新颖的基于显著性的视觉注意力计算模型。该模型同时处理自上而下(目标导向)和自下而上的信息。自上而下通道的处理创建了所谓的皮肤显著图,并模拟了人类执行的对人脸的视觉搜索。这显然是一个目标导向任务,但具有足够的通用性,与上下文无关。自下而上信息通道的处理遵循Itti等人设定的原则,但通过在基于小波子带分析的统一多分辨率框架内计算方向、强度和颜色显著图,与这些原则有所不同。特别是,我们应用基于小波的方法来高效计算地形特征图。鉴于小波和多分辨率理论自然相关,使用小波分解来模仿人类的中心环绕过程是一个明显的选择。然而,我们的实现更进一步。我们利用小波分解对用于创建地形特征图的特征(如方向角)进行在线计算。然后,通过一个Sigmoid函数将自下而上的地形特征图和自上而下的皮肤显著图进行组合,以生成最终的显著图。所提出模型的一个原型通过TMDSDMK642 - 0E DSP平台作为嵌入式系统实现,允许实时操作。为了进行评估,在感知视觉质量和视频压缩改进方面,遵循了基于感兴趣区域(ROI)的视频压缩设置。针对MPEG - 1以及低比特率MPEG - 4视频编码进行了扩展实验,结果表明视频压缩效率有显著提高,且视觉质量没有明显下降。

相似文献

1
An embedded saliency map estimator scheme: application to video encoding.一种嵌入式显著性图估计方案:应用于视频编码。
Int J Neural Syst. 2007 Aug;17(4):289-304. doi: 10.1142/S0129065707001147.
2
A neural network implementation of a saliency map model.一种显著性地图模型的神经网络实现。
Neural Netw. 2006 Dec;19(10):1467-74. doi: 10.1016/j.neunet.2005.12.004. Epub 2006 May 9.
3
Decision-theoretic saliency: computational principles, biological plausibility, and implications for neurophysiology and psychophysics.决策理论显著性:计算原理、生物学合理性及其对神经生理学和心理物理学的影响
Neural Comput. 2009 Jan;21(1):239-71. doi: 10.1162/neco.2009.11-06-391.
4
Predicting visual fixations on video based on low-level visual features.基于低级视觉特征预测视频中的视觉注视点。
Vision Res. 2007 Sep;47(19):2483-98. doi: 10.1016/j.visres.2007.06.015. Epub 2007 Aug 3.
5
A novel multiresolution spatiotemporal saliency detection model and its applications in image and video compression.一种新的多分辨率时空显著检测模型及其在图像和视频压缩中的应用。
IEEE Trans Image Process. 2010 Jan;19(1):185-98. doi: 10.1109/TIP.2009.2030969.
6
Top-down attention based on object representation and incremental memory for knowledge building and inference.基于对象表示和增量记忆的自上而下的注意力,用于知识构建和推理。
Neural Netw. 2013 Oct;46:9-22. doi: 10.1016/j.neunet.2013.04.002. Epub 2013 Apr 8.
7
Modeling eye movements in visual agnosia with a saliency map approach: bottom-up guidance or top-down strategy?基于显著图模型的视觉失认症眼动研究:自下而上引导还是自上而下策略?
Neural Netw. 2011 Aug;24(6):665-77. doi: 10.1016/j.neunet.2011.01.004. Epub 2011 Jan 27.
8
Cue-guided search: a computational model of selective attention.线索引导搜索:选择性注意的一种计算模型。
IEEE Trans Neural Netw. 2005 Jul;16(4):910-24. doi: 10.1109/TNN.2005.851787.
9
Sources of top-down control in visual search.视觉搜索中自上而下控制的来源。
J Cogn Neurosci. 2009 Nov;21(11):2100-13. doi: 10.1162/jocn.2008.21173.
10
Psychophysical tests of the hypothesis of a bottom-up saliency map in primary visual cortex.对初级视觉皮层中自下而上显著图假说的心理物理学测试。
PLoS Comput Biol. 2007 Apr 6;3(4):e62. doi: 10.1371/journal.pcbi.0030062. Epub 2007 Feb 20.

引用本文的文献

1
An efficient hierarchical video coding scheme combining visual perception characteristics.一种结合视觉感知特性的高效分层视频编码方案。
ScientificWorldJournal. 2014;2014:727943. doi: 10.1155/2014/727943. Epub 2014 May 13.