• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于视频异常检测的时空预测与重建网络。

Spatio-temporal prediction and reconstruction network for video anomaly detection.

作者信息

Liu Ting, Zhang Chengqing, Niu Xiaodong, Wang Liming

机构信息

State Key Lab for Electronic Testing Technology, North University of China, Taiyuan, 030051, China.

College of Mechatronics Engineering, North University of China, Tai Yuan, 030051, China.

出版信息

PLoS One. 2022 May 26;17(5):e0265564. doi: 10.1371/journal.pone.0265564. eCollection 2022.

DOI:10.1371/journal.pone.0265564
PMID:35617331
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9135234/
Abstract

The existing anomaly detection methods can be divided into two popular models based on reconstruction or future frame prediction. Due to the strong learning capacity, reconstruction approach can hardly generate significant reconstruction errors for anomalies, whereas future frame prediction approach is sensitive to noise in complicated scenarios. Therefore, a solution has been proposed by balancing the merits and demerits of the two models. However, most methods relied on single-scale information to capture spatial features and lacked temporal continuity between the video frames, affecting anomaly detection accuracy. Thus, we propose a novel method to improve anomaly detection performance. Because of the objects of various scales in each video, we select different receptive fields to extract comprehensive spatial features by the hybrid dilated convolution (HDC) module. Meanwhile, the deeper bidirectional convolutional long short-term memory (DB-ConvLSTM) module can remember the temporal information between the consecutive frames. Experiments prove that our method can detect abnormalities in various video scenes more accurately than the state-of-the-art methods in the anomaly-detection task.

摘要

现有的异常检测方法可基于重建或未来帧预测分为两种流行模型。由于强大的学习能力,重建方法很难为异常生成显著的重建误差,而未来帧预测方法在复杂场景中对噪声敏感。因此,通过平衡这两种模型的优缺点提出了一种解决方案。然而,大多数方法依赖单尺度信息来捕捉空间特征,并且视频帧之间缺乏时间连续性,影响了异常检测的准确性。因此,我们提出了一种新颖的方法来提高异常检测性能。由于每个视频中存在各种尺度的对象,我们通过混合扩张卷积(HDC)模块选择不同的感受野来提取综合空间特征。同时,更深层次的双向卷积长短期记忆(DB-ConvLSTM)模块可以记住连续帧之间的时间信息。实验证明,在异常检测任务中,我们的方法比现有最先进的方法能更准确地检测各种视频场景中的异常。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/3c64e1ef65d8/pone.0265564.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/bce4e804d46f/pone.0265564.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/7a56c1dda4da/pone.0265564.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/97db83ef58a9/pone.0265564.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/da11ac0d2f21/pone.0265564.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/8dbd990d6a6c/pone.0265564.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/b1236e99bd9b/pone.0265564.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/2a2160d70c56/pone.0265564.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/5807ead94421/pone.0265564.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/f0311348c756/pone.0265564.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/145be9ce6f25/pone.0265564.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/3c64e1ef65d8/pone.0265564.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/bce4e804d46f/pone.0265564.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/7a56c1dda4da/pone.0265564.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/97db83ef58a9/pone.0265564.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/da11ac0d2f21/pone.0265564.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/8dbd990d6a6c/pone.0265564.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/b1236e99bd9b/pone.0265564.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/2a2160d70c56/pone.0265564.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/5807ead94421/pone.0265564.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/f0311348c756/pone.0265564.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/145be9ce6f25/pone.0265564.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ac4/9135234/3c64e1ef65d8/pone.0265564.g011.jpg

相似文献

1
Spatio-temporal prediction and reconstruction network for video anomaly detection.用于视频异常检测的时空预测与重建网络。
PLoS One. 2022 May 26;17(5):e0265564. doi: 10.1371/journal.pone.0265564. eCollection 2022.
2
Integrated Multiscale Appearance Features and Motion Information Prediction Network for Anomaly Detection.基于多尺度外观特征与运动信息预测网络的异常检测。
Comput Intell Neurosci. 2021 Oct 20;2021:6789956. doi: 10.1155/2021/6789956. eCollection 2021.
3
Video Anomaly Detection Based on Convolutional Recurrent AutoEncoder.基于卷积递归自动编码器的视频异常检测。
Sensors (Basel). 2022 Jun 20;22(12):4647. doi: 10.3390/s22124647.
4
Robust Unsupervised Video Anomaly Detection by Multipath Frame Prediction.通过多路径帧预测实现强大的无监督视频异常检测
IEEE Trans Neural Netw Learn Syst. 2022 Jun;33(6):2301-2312. doi: 10.1109/TNNLS.2021.3083152. Epub 2022 Jun 1.
5
Multi-Memory Convolutional Neural Network for Video Super-Resolution.用于视频超分辨率的多记忆卷积神经网络。
IEEE Trans Image Process. 2018 Dec 17. doi: 10.1109/TIP.2018.2887017.
6
A Spatial-Temporal Recurrent Neural Network for Video Saliency Prediction.一种用于视频显著性预测的时空循环神经网络。
IEEE Trans Image Process. 2021;30:572-587. doi: 10.1109/TIP.2020.3036749. Epub 2020 Nov 24.
7
DSTAN: A Deformable Spatial-temporal Attention Network with Bidirectional Sequence Feature Refinement for Speckle Noise Removal in Thyroid Ultrasound Video.DSTAN:一种具有双向序列特征细化的可变形时空注意力网络,用于去除甲状腺超声视频中的斑点噪声。
J Imaging Inform Med. 2024 Dec;37(6):3264-3281. doi: 10.1007/s10278-023-00935-5. Epub 2024 Jun 5.
8
Future Frame Prediction Network for Video Anomaly Detection.用于视频异常检测的未来帧预测网络
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7505-7520. doi: 10.1109/TPAMI.2021.3129349. Epub 2022 Oct 4.
9
SSIM over MSE: A new perspective for video anomaly detection.
Neural Netw. 2025 May;185:107115. doi: 10.1016/j.neunet.2024.107115. Epub 2025 Jan 14.
10
Weakly Supervised Video Anomaly Detection Based on 3D Convolution and LSTM.基于 3D 卷积和 LSTM 的弱监督视频异常检测。
Sensors (Basel). 2021 Nov 12;21(22):7508. doi: 10.3390/s21227508.

本文引用的文献

1
Deep End-to-End One-Class Classifier.深度端到端单类分类器。
IEEE Trans Neural Netw Learn Syst. 2021 Feb;32(2):675-684. doi: 10.1109/TNNLS.2020.2979049. Epub 2021 Feb 4.
2
Deep learning approach to classification of lung cytological images: Two-step training using actual and synthesized images by progressive growing of generative adversarial networks.深度学习在肺细胞图像分类中的应用:基于生成对抗网络的渐进式增长的两步训练,使用真实和合成图像。
PLoS One. 2020 Mar 5;15(3):e0229951. doi: 10.1371/journal.pone.0229951. eCollection 2020.
3
Temporal Segment Networks for Action Recognition in Videos.
用于视频动作识别的时态片段网络
IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2740-2755. doi: 10.1109/TPAMI.2018.2868668. Epub 2018 Sep 3.
4
Image quality assessment: from error visibility to structural similarity.图像质量评估:从误差可见性到结构相似性。
IEEE Trans Image Process. 2004 Apr;13(4):600-12. doi: 10.1109/tip.2003.819861.