通过深度展开设计用于视频重建的可解释循环神经网络。

Designing Interpretable Recurrent Neural Networks for Video Reconstruction via Deep Unfolding.

出版信息

IEEE Trans Image Process. 2021;30:4099-4113. doi: 10.1109/TIP.2021.3069296. Epub 2021 Apr 8.

DOI:10.1109/TIP.2021.3069296

Abstract

Deep unfolding methods design deep neural networks as learned variations of optimization algorithms through the unrolling of their iterations. These networks have been shown to achieve faster convergence and higher accuracy than the original optimization methods. In this line of research, this paper presents novel interpretable deep recurrent neural networks (RNNs), designed by the unfolding of iterative algorithms that solve the task of sequential signal reconstruction (in particular, video reconstruction). The proposed networks are designed by accounting that video frames' patches have a sparse representation and the temporal difference between consecutive representations is also sparse. Specifically, we design an interpretable deep RNN (coined reweighted-RNN) by unrolling the iterations of a proximal method that solves a reweighted version of the l - l minimization problem. Due to the underlying minimization model, our reweighted-RNN has a different thresholding function (alias, different activation function) for each hidden unit in each layer. In this way, it has higher network expressivity than existing deep unfolding RNN models. We also present the derivative l - l -RNN model, which is obtained by unfolding a proximal method for the l - l minimization problem. We apply the proposed interpretable RNNs to the task of video frame reconstruction from low-dimensional measurements, that is, sequential video frame reconstruction. The experimental results on various datasets demonstrate that the proposed deep RNNs outperform various RNN models.

摘要

深度展开方法通过迭代展开将深度神经网络设计为优化算法的学习变体。这些网络已经被证明比原始优化方法具有更快的收敛速度和更高的准确性。在这一研究领域中，本文提出了新颖的可解释深度递归神经网络（RNN），这些网络通过展开求解序列信号重建任务（特别是视频重建）的迭代算法来设计。所提出的网络通过考虑到视频帧的补丁具有稀疏表示，并且连续表示之间的时间差也是稀疏的，从而进行设计。具体来说，我们通过展开求解重加权 l - l 最小化问题的近端方法的迭代来设计可解释的深度 RNN（重加权-RNN）。由于基础的最小化模型，我们的重加权-RNN 在每层的每个隐藏单元中具有不同的阈值函数（别名，不同的激活函数）。通过这种方式，它比现有的深度展开 RNN 模型具有更高的网络表达能力。我们还提出了导数 l - l -RNN 模型，它是通过展开求解 l - l 最小化问题的近端方法获得的。我们将所提出的可解释 RNN 应用于从低维测量中重建视频帧的任务，即顺序视频帧重建。在各种数据集上的实验结果表明，所提出的深度 RNN 优于各种 RNN 模型。

相似文献

Designing Interpretable Recurrent Neural Networks for Video Reconstruction via Deep Unfolding.通过深度展开设计用于视频重建的可解释循环神经网络。

IEEE Trans Image Process. 2021;30:4099-4113. doi: 10.1109/TIP.2021.3069296. Epub 2021 Apr 8.

Sparse signal reconstruction via recurrent neural networks with hyperbolic tangent function.基于双曲正切函数的递归神经网络稀疏信号重构。

Neural Netw. 2022 Sep;153:1-12. doi: 10.1016/j.neunet.2022.05.022. Epub 2022 Jun 2.

Interpretable Neural Networks for Video Separation: Deep Unfolding RPCA With Foreground Masking.用于视频分离的可解释神经网络：带前景掩码的深度展开鲁棒主成分分析

IEEE Trans Image Process. 2024;33:108-122. doi: 10.1109/TIP.2023.3336176. Epub 2023 Dec 8.

Assessment of Automated Identification of Phases in Videos of Cataract Surgery Using Machine Learning and Deep Learning Techniques.使用机器学习和深度学习技术评估白内障手术视频中的相位自动识别。

JAMA Netw Open. 2019 Apr 5;2(4):e191860. doi: 10.1001/jamanetworkopen.2019.1860.

Hyperspectral Image Features Classification Using Deep Learning Recurrent Neural Networks.基于深度学习循环神经网络的高光谱图像特征分类。

J Med Syst. 2019 Jun 4;43(7):216. doi: 10.1007/s10916-019-1347-9.

Spatial-Temporal Recurrent Neural Network for Emotion Recognition.基于时空递归神经网络的情绪识别。

IEEE Trans Cybern. 2019 Mar;49(3):839-847. doi: 10.1109/TCYB.2017.2788081. Epub 2018 Jan 30.

Monitoring tool usage in surgery videos using boosted convolutional and recurrent neural networks.使用增强卷积和递归神经网络监测手术视频中的工具使用情况。

Med Image Anal. 2018 Jul;47:203-218. doi: 10.1016/j.media.2018.05.001. Epub 2018 May 9.

Learning With Interpretable Structure From Gated RNN.基于门控 RNN 的可解释结构学习。

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2267-2279. doi: 10.1109/TNNLS.2020.2967051. Epub 2020 Feb 13.

Generalized Recurrent Neural Network accommodating Dynamic Causal Modeling for functional MRI analysis.广义循环神经网络适应功能磁共振成像分析的动态因果建模。

Neuroimage. 2018 Sep;178:385-402. doi: 10.1016/j.neuroimage.2018.05.042. Epub 2018 May 18.

Convolutional Recurrent Neural Networks for Dynamic MR Image Reconstruction.卷积循环神经网络在动态磁共振图像重建中的应用。

IEEE Trans Med Imaging. 2019 Jan;38(1):280-290. doi: 10.1109/TMI.2018.2863670. Epub 2018 Aug 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过深度展开设计用于视频重建的可解释循环神经网络。

Designing Interpretable Recurrent Neural Networks for Video Reconstruction via Deep Unfolding.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献