Chen Chenglizhao, Wang Guotao, Peng Chong, Zhang Xiaowei, Qin Hong
IEEE Trans Image Process. 2019 Aug 23. doi: 10.1109/TIP.2019.2934350.
This paper proposes to utilize supervised deep convolutional neural networks to take full advantage of the long-term spatial-temporal information in order to improve the video saliency detection performance. The conventional methods, which use the temporally neighbored frames solely, could easily encounter transient failure cases when the spatial-temporal saliency clues are less-trustworthy for a long period. To tackle the aforementioned limitation, we plan to identify those beyond-scope frames with trustworthy long-term saliency clues first and then align it with the current problem domain for an improved video saliency detection.
本文提出利用有监督的深度卷积神经网络来充分利用长期的时空信息,以提高视频显著性检测性能。传统方法仅使用时间上相邻的帧,当长期时空显著性线索在很长一段时间内不太可靠时,很容易遇到瞬态失败的情况。为了解决上述局限性,我们计划首先识别那些具有可靠长期显著性线索的超出范围的帧,然后将其与当前问题域对齐,以改进视频显著性检测。