一种用于视频显著性预测的时空循环神经网络。

A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction.

作者信息

Zhang Kao, Chen Zhenzhong, Liu Shan

出版信息

IEEE Trans Image Process. 2021;30:572-587. doi: 10.1109/TIP.2020.3036749. Epub 2020 Nov 24.

DOI:10.1109/TIP.2020.3036749

PMID:33206602

Abstract

In this paper, a recurrent neural network is designed for video saliency prediction considering spatial-temporal features. In our work, video frames are routed through the static network for spatial features and the dynamic network for temporal features. For the spatial-temporal feature integration, a novel select and re-weight fusion model is proposed which can learn and adjust the fusion weights based on the spatial and temporal features in different scenes automatically. Finally, an attention-aware convolutional long short term memory (ConvLSTM) network is developed to predict salient regions based on the features extracted from consecutive frames and generate the ultimate saliency map for each video frame. The proposed method is compared with state-of-the-art saliency models on five public video saliency benchmark datasets. The experimental results demonstrate that our model can achieve advanced performance on video saliency prediction.

摘要

本文设计了一种考虑时空特征的循环神经网络用于视频显著性预测。在我们的工作中，视频帧通过用于空间特征的静态网络和用于时间特征的动态网络进行路由。对于时空特征融合，提出了一种新颖的选择和重新加权融合模型，该模型可以根据不同场景中的空间和时间特征自动学习和调整融合权重。最后，开发了一种注意力感知卷积长短期记忆（ConvLSTM）网络，基于从连续帧中提取的特征预测显著区域，并为每个视频帧生成最终的显著性图。在五个公共视频显著性基准数据集上，将所提出的方法与当前最先进的显著性模型进行了比较。实验结果表明，我们的模型在视频显著性预测方面可以取得先进的性能。

相似文献

A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction.一种用于视频显著性预测的时空循环神经网络。

IEEE Trans Image Process. 2021;30:572-587. doi: 10.1109/TIP.2020.3036749. Epub 2020 Nov 24.

A Deep Spatial Contextual Long-Term Recurrent Convolutional Network for Saliency Detection.基于深度空间上下文的显著性检测长短期记忆卷积网络。

IEEE Trans Image Process. 2018 Jul;27(7):3264-3274. doi: 10.1109/TIP.2018.2817047.

Visual Attention Prediction for Stereoscopic Video by Multi-Module Fully Convolutional Network.基于多模块全卷积网络的立体视频视觉注意力预测

IEEE Trans Image Process. 2019 Nov;28(11):5253-5265. doi: 10.1109/TIP.2019.2916766. Epub 2019 May 20.

Video Saliency Detection via Sparsity-Based Reconstruction and Propagation.基于稀疏重建与传播的视频显著度检测

IEEE Trans Image Process. 2019 Oct;28(10):4819-4831. doi: 10.1109/TIP.2019.2910377. Epub 2019 May 2.

Video Salient Object Detection via Fully Convolutional Networks.基于全卷积网络的视频显著目标检测

IEEE Trans Image Process. 2018;27(1):38-49. doi: 10.1109/TIP.2017.2754941.

End-to-End Video Saliency Detection via a Deep Contextual Spatiotemporal Network.通过深度上下文时空网络实现的端到端视频显著目标检测

IEEE Trans Neural Netw Learn Syst. 2021 Apr;32(4):1691-1702. doi: 10.1109/TNNLS.2020.2986823. Epub 2021 Apr 2.

Deep3DSaliency: Deep Stereoscopic Video Saliency Detection Model by 3D Convolutional Networks.深度3D显著度：基于3D卷积网络的深度立体视频显著度检测模型

IEEE Trans Image Process. 2018 Dec 5. doi: 10.1109/TIP.2018.2885229.

SG-FCN: A Motion and Memory-Based Deep Learning Model for Video Saliency Detection.SG-FCN：一种基于运动和记忆的视频显著性检测深度学习模型。

IEEE Trans Cybern. 2019 Aug;49(8):2900-2911. doi: 10.1109/TCYB.2018.2832053. Epub 2018 May 25.

Video Salient Object Detection Using Spatiotemporal Deep Features.基于时空深度特征的视频显著目标检测

IEEE Trans Image Process. 2018 Oct;27(10):5002-5015. doi: 10.1109/TIP.2018.2849860.

Salient object detection with spatiotemporal background priors for video.基于时空背景先验的视频显著目标检测

IEEE Trans Image Process. 2017 Jul;26(7):3425-3436. doi: 10.1109/TIP.2016.2631900. Epub 2016 Nov 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于视频显著性预测的时空循环神经网络。

A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction.

作者信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献