• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卷积神经网络-视觉Transformer的弱监督视频片段级异常检测

CNN-ViT Supported Weakly-Supervised Video Segment Level Anomaly Detection.

作者信息

Sharif Md Haidar, Jiao Lei, Omlin Christian W

机构信息

Department of ICT, University of Agder, 4630 Kristiansand, Norway.

出版信息

Sensors (Basel). 2023 Sep 7;23(18):7734. doi: 10.3390/s23187734.

DOI:10.3390/s23187734
PMID:37765792
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10537718/
Abstract

Video anomaly event detection (VAED) is one of the key technologies in computer vision for smart surveillance systems. With the advent of deep learning, contemporary advances in VAED have achieved substantial success. Recently, weakly supervised VAED (WVAED) has become a popular VAED technical route of research. WVAED methods do not depend on a supplementary self-supervised substitute task, yet they can assess anomaly scores straightway. However, the performance of WVAED methods depends on pretrained feature extractors. In this paper, we first address taking advantage of two pretrained feature extractors for CNN (e.g., C3D and I3D) and ViT (e.g., CLIP), for effectively extracting discerning representations. We then consider long-range and short-range temporal dependencies and put forward video snippets of interest by leveraging our proposed temporal self-attention network (TSAN). We design a multiple instance learning (MIL)-based generalized architecture named CNN-ViT-TSAN, by using CNN- and/or ViT-extracted features and TSAN to specify a series of models for the WVAED problem. Experimental results on publicly available popular crowd datasets demonstrated the effectiveness of our CNN-ViT-TSAN.

摘要

视频异常事件检测(VAED)是智能监控系统计算机视觉中的关键技术之一。随着深度学习的出现,VAED的当代进展取得了巨大成功。最近,弱监督VAED(WVAED)已成为一种流行的VAED技术研究路线。WVAED方法不依赖于补充的自监督替代任务,但它们可以直接评估异常分数。然而,WVAED方法的性能取决于预训练的特征提取器。在本文中,我们首先利用两个用于卷积神经网络(CNN)(例如C3D和I3D)和视觉Transformer(ViT)(例如CLIP)的预训练特征提取器,以有效提取有辨别力的表示。然后,我们考虑长程和短程时间依赖性,并通过利用我们提出的时间自注意力网络(TSAN)提出感兴趣的视频片段。我们设计了一种基于多实例学习(MIL)的广义架构,名为CNN-ViT-TSAN,通过使用CNN和/或ViT提取的特征以及TSAN为WVAED问题指定一系列模型。在公开可用的流行人群数据集上的实验结果证明了我们的CNN-ViT-TSAN的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/3344fa89c84e/sensors-23-07734-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/9d58e718c81e/sensors-23-07734-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/bcff1c2a3910/sensors-23-07734-g002a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/2771b5776f93/sensors-23-07734-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/700fd6c0528a/sensors-23-07734-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/3344fa89c84e/sensors-23-07734-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/9d58e718c81e/sensors-23-07734-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/bcff1c2a3910/sensors-23-07734-g002a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/2771b5776f93/sensors-23-07734-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/700fd6c0528a/sensors-23-07734-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5abb/10537718/3344fa89c84e/sensors-23-07734-g005.jpg

相似文献

1
CNN-ViT Supported Weakly-Supervised Video Segment Level Anomaly Detection.基于卷积神经网络-视觉Transformer的弱监督视频片段级异常检测
Sensors (Basel). 2023 Sep 7;23(18):7734. doi: 10.3390/s23187734.
2
Multimodal and multiscale feature fusion for weakly supervised video anomaly detection.用于弱监督视频异常检测的多模态和多尺度特征融合
Sci Rep. 2024 Oct 1;14(1):22835. doi: 10.1038/s41598-024-73462-0.
3
Weakly Supervised Video Anomaly Detection via Self-Guided Temporal Discriminative Transformer.基于自引导时间判别变压器的弱监督视频异常检测
IEEE Trans Cybern. 2024 May;54(5):3197-3210. doi: 10.1109/TCYB.2022.3227044. Epub 2024 Apr 16.
4
SERE: Exploring Feature Self-Relation for Self-Supervised Transformer.SERE:探索用于自监督Transformer的特征自关系
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):15619-15631. doi: 10.1109/TPAMI.2023.3309979. Epub 2023 Nov 3.
5
Seeking an optimal approach for Computer-aided Diagnosis of Pulmonary Embolism.寻求肺栓塞计算机辅助诊断的最佳方法。
Med Image Anal. 2024 Jan;91:102988. doi: 10.1016/j.media.2023.102988. Epub 2023 Oct 13.
6
Semi-Supervised Anomaly Detection in Video-Surveillance Scenes in the Wild.半监督异常检测在野外视频监控场景中的应用。
Sensors (Basel). 2021 Jun 9;21(12):3993. doi: 10.3390/s21123993.
7
Modeling long-range dependencies for weakly supervised disease classification and localization on chest X-ray.胸部X光片上弱监督疾病分类与定位的长程依赖建模
Quant Imaging Med Surg. 2022 Jun;12(6):3364-3378. doi: 10.21037/qims-21-1117.
8
A VGG attention vision transformer network for benign and malignant classification of breast ultrasound images.基于 VGG 注意力机制的视觉Transformer 网络在乳腺超声图像良恶性分类中的应用。
Med Phys. 2022 Sep;49(9):5787-5798. doi: 10.1002/mp.15852. Epub 2022 Jul 30.
9
Semi-supervised vision transformer with adaptive token sampling for breast cancer classification.用于乳腺癌分类的具有自适应令牌采样的半监督视觉Transformer
Front Pharmacol. 2022 Jul 22;13:929755. doi: 10.3389/fphar.2022.929755. eCollection 2022.
10
Transformer-based unsupervised contrastive learning for histopathological image classification.基于 Transformer 的无监督对比学习在组织病理学图像分类中的应用。
Med Image Anal. 2022 Oct;81:102559. doi: 10.1016/j.media.2022.102559. Epub 2022 Jul 30.

本文引用的文献

1
Learning Prompt-Enhanced Context Features for Weakly-Supervised Video Anomaly Detection.学习用于弱监督视频异常检测的提示增强上下文特征。
IEEE Trans Image Process. 2024;33:4923-4936. doi: 10.1109/TIP.2024.3451935. Epub 2024 Sep 11.
2
Clustering Aided Weakly Supervised Training to Detect Anomalous Events in Surveillance Videos.聚类辅助的弱监督训练用于检测监控视频中的异常事件。
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14085-14098. doi: 10.1109/TNNLS.2023.3274611. Epub 2024 Oct 7.
3
CNN-LSTM vs. LSTM-CNN to Predict Power Flow Direction: A Case Study of the High-Voltage Subnet of Northeast Germany.
CNN-LSTM 与 LSTM-CNN 预测潮流方向:以德国东北部高压子网为例。
Sensors (Basel). 2023 Jan 12;23(2):901. doi: 10.3390/s23020901.
4
Localizing Anomalies From Weakly-Labeled Videos.从弱标注视频中定位异常
IEEE Trans Image Process. 2021;30:4505-4515. doi: 10.1109/TIP.2021.3072863. Epub 2021 Apr 28.
5
Learning Causal Temporal Relation and Feature Discrimination for Anomaly Detection.用于异常检测的因果时间关系学习与特征辨别
IEEE Trans Image Process. 2021;30:3513-3527. doi: 10.1109/TIP.2021.3062192. Epub 2021 Mar 11.
6
Temporal Segment Networks for Action Recognition in Videos.用于视频动作识别的时态片段网络
IEEE Trans Pattern Anal Mach Intell. 2019 Nov;41(11):2740-2755. doi: 10.1109/TPAMI.2018.2868668. Epub 2018 Sep 3.
7
3D convolutional neural networks for human action recognition.三维卷积神经网络的人体动作识别。
IEEE Trans Pattern Anal Mach Intell. 2013 Jan;35(1):221-31. doi: 10.1109/TPAMI.2012.59.