用于监控视频摘要的人类视觉敏感特征自适应融合

Adaptive fusion of human visual sensitive features for surveillance video summarization.

作者信息

Salehin Md Musfequs, Paul Manoranjan

出版信息

J Opt Soc Am A Opt Image Sci Vis. 2017 May 1;34(5):814-826. doi: 10.1364/JOSAA.34.000814.

DOI:10.1364/JOSAA.34.000814

Abstract

Surveillance video cameras capture large amounts of continuous video streams every day. To analyze or investigate any significant events, it is a laborious and boring job to identify these events from the huge video data if it is done manually. Existing approaches sometimes neglect key frames with significant visual contents and/or select some unimportant frames with low/no activity. To solve this problem, in this paper, a video summarization technique is proposed by combining three multimodal human visual sensitive features, such as foreground objects, motion information, and visual saliency. In a video stream, foreground objects are one of the most important pieces of a video as they contain more detailed information and play a major role in important events. Moreover, motion is another stimulus of a video that significantly attracts human visual attention. To obtain this, motion information is calculated in the spatial domain as well as the frequency domain. Spatial motion information can select object motion accurately; however, it is sensitive to illumination changes. On the other hand, frequency motion information is robust to illumination change, although it is easily affected by noise. Therefore, motion information in both the spatial and the frequency domains is employed. Furthermore, the visual attention cue is a sensitive feature to measure the indication of a user's attraction label for determining key frames. As these features individually cannot perform very well, they are combined to obtain better results. For this purpose, an adaptive linear weighted fusion scheme is proposed to combine the features to rank video frames for summarization. Experimental results reveal that the proposed method outperforms the state-of-the-art methods.

摘要

监控摄像机每天都会捕捉大量的连续视频流。要分析或调查任何重大事件，如果手动从海量视频数据中识别这些事件，都是一项费力且枯燥的工作。现有方法有时会忽略具有重要视觉内容的关键帧，并且/或者选择一些没有活动或活动较少的不重要帧。为了解决这个问题，本文提出了一种视频摘要技术，该技术结合了前景对象、运动信息和视觉显著性这三个多模态人类视觉敏感特征。在视频流中，前景对象是视频中最重要的部分之一，因为它们包含更详细的信息，并且在重要事件中起主要作用。此外，运动是视频的另一种刺激因素，能显著吸引人类的视觉注意力。为了获取运动信息，在空间域和频率域都进行了计算。空间运动信息可以准确地选择对象运动；然而，它对光照变化很敏感。另一方面，频率运动信息对光照变化具有鲁棒性，尽管它很容易受到噪声的影响。因此，采用了空间域和频率域的运动信息。此外，视觉注意力线索是一种敏感特征，用于测量用户的吸引标签指示以确定关键帧。由于这些特征单独表现不佳，因此将它们组合起来以获得更好的结果。为此，提出了一种自适应线性加权融合方案来组合这些特征，以便对视频帧进行排序以进行摘要。实验结果表明，所提出的方法优于现有方法。

相似文献

Adaptive fusion of human visual sensitive features for surveillance video summarization.用于监控视频摘要的人类视觉敏感特征自适应融合

J Opt Soc Am A Opt Image Sci Vis. 2017 May 1;34(5):814-826. doi: 10.1364/JOSAA.34.000814.

MRT letter: visual attention driven framework for hysteroscopy video abstraction.MRT 信：用于宫腔镜视频抽象的视觉注意驱动框架。

Microsc Res Tech. 2013 Jun;76(6):559-63. doi: 10.1002/jemt.22205. Epub 2013 Mar 30.

Scalable gastroscopic video summarization via similar-inhibition dictionary selection.通过相似抑制字典选择实现可扩展的胃镜视频摘要

Artif Intell Med. 2016 Jan;66:1-13. doi: 10.1016/j.artmed.2015.08.006. Epub 2015 Aug 18.

Video saliency incorporating spatiotemporal cues and uncertainty weighting.视频显著度融合时空线索和不确定性加权。

IEEE Trans Image Process. 2014 Sep;23(9):3910-21. doi: 10.1109/TIP.2014.2336549. Epub 2014 Jul 16.

Regularized feature reconstruction for spatio-temporal saliency detection.正则化特征重构的时空显著检测。

IEEE Trans Image Process. 2013 Aug;22(8):3120-32. doi: 10.1109/TIP.2013.2259837.

Motion segmentation and depth ordering using an occlusion detector.使用遮挡检测器进行运动分割和深度排序。

IEEE Trans Pattern Anal Mach Intell. 2008 Jul;30(7):1171-85. doi: 10.1109/TPAMI.2007.70766.

Keyframe extraction from laparoscopic videos based on visual saliency detection.基于视觉显著性检测的腹腔镜视频关键帧提取。

Comput Methods Programs Biomed. 2018 Oct;165:13-23. doi: 10.1016/j.cmpb.2018.07.004. Epub 2018 Jul 18.

Interactive exploration of surveillance video through action shot summarization and trajectory visualization.通过动作镜头摘要和轨迹可视化进行监控视频的交互式探索。

IEEE Trans Vis Comput Graph. 2013 Dec;19(12):2119-28. doi: 10.1109/TVCG.2013.168.

Robust global motion estimation oriented to video object segmentation.面向视频对象分割的鲁棒全局运动估计

IEEE Trans Image Process. 2008 Jun;17(6):958-67. doi: 10.1109/TIP.2008.921985.

Video summarization using line segments, angles and conic parts.使用线段、角度和圆锥曲线部分的视频摘要。

PLoS One. 2017 Nov 9;12(11):e0181636. doi: 10.1371/journal.pone.0181636. eCollection 2017.

用于监控视频摘要的人类视觉敏感特征自适应融合

Adaptive fusion of human visual sensitive features for surveillance video summarization.

作者信息

Salehin Md Musfequs, Paul Manoranjan

出版信息

J Opt Soc Am A Opt Image Sci Vis. 2017 May 1;34(5):814-826. doi: 10.1364/JOSAA.34.000814.

DOI:10.1364/JOSAA.34.000814

PMID:28463326

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

用于监控视频摘要的人类视觉敏感特征自适应融合

Adaptive fusion of human visual sensitive features for surveillance video summarization.

作者信息

出版信息

相似文献

用于监控视频摘要的人类视觉敏感特征自适应融合

Adaptive fusion of human visual sensitive features for surveillance video summarization.

作者信息

出版信息

相似文献