Centre for Multimedia and Network Technology, School of Computer Engineering, Nanyang Technological University, 639798, Singapore.
IEEE Trans Image Process. 2013 Aug;22(8):3120-32. doi: 10.1109/TIP.2013.2259837.
Multimedia applications such as image or video retrieval, copy detection, and so forth can benefit from saliency detection, which is essentially a method to identify areas in images and videos that capture the attention of the human visual system. In this paper, we propose a new spatio-temporal saliency detection framework on the basis of regularized feature reconstruction. Specifically, for video saliency detection, both the temporal and spatial saliency detection are considered. For temporal saliency, we model the movement of the target patch as a reconstruction process using the patches in neighboring frames. A Laplacian smoothing term is introduced to model the coherent motion trajectories. With psychological findings that abrupt stimulus could cause a rapid and involuntary deployment of attention, our temporal model combines the reconstruction error, regularizer, and local trajectory contrast to measure the temporal saliency. For spatial saliency, a similar sparse reconstruction process is adopted to capture the regions with high center-surround contrast. Finally, the temporal saliency and spatial saliency are combined together to favor salient regions with high confidence for video saliency detection. We also apply the spatial saliency part of the spatio-temporal model to image saliency detection. Experimental results on a human fixation video dataset and an image saliency detection dataset show that our method achieves the best performance over several state-of-the-art approaches.
多媒体应用,如图像或视频检索、复制检测等,可以受益于显著检测,这本质上是一种识别图像和视频中捕获人类视觉系统注意力的区域的方法。在本文中,我们提出了一种新的基于正则化特征重建的时空显著检测框架。具体来说,对于视频显著检测,同时考虑了时间和空间显著检测。对于时间显著检测,我们将目标补丁的运动建模为使用相邻帧中的补丁进行重建的过程。引入拉普拉斯平滑项来建模连贯的运动轨迹。根据突然的刺激会引起注意力快速而无意识的转移的心理学发现,我们的时间模型结合了重建误差、正则化项和局部轨迹对比来衡量时间显著检测。对于空间显著检测,采用类似的稀疏重建过程来捕获具有高中心-周围对比度的区域。最后,将时间显著检测和空间显著检测结合起来,有利于对视频显著检测具有高置信度的显著区域。我们还将时空模型的空间显著检测部分应用于图像显著检测。在人类注视视频数据集和图像显著检测数据集上的实验结果表明,我们的方法在几种最先进的方法中取得了最佳性能。