学习用于弱监督视频异常检测的提示增强上下文特征。

Learning Prompt-Enhanced Context Features for Weakly-Supervised Video Anomaly Detection.

作者信息

Pu Yujiang, Wu Xiaoyu, Yang Lulu, Wang Shengjin

出版信息

IEEE Trans Image Process. 2024;33:4923-4936. doi: 10.1109/TIP.2024.3451935. Epub 2024 Sep 11.

DOI:10.1109/TIP.2024.3451935

Abstract

Weakly supervised video anomaly detection aims to locate abnormal activities in untrimmed videos without the need for frame-level supervision. Prior work has utilized graph convolution networks or self-attention mechanisms alongside multiple instance learning (MIL)-based classification loss to model temporal relations and learn discriminative features. However, these approaches are limited in two aspects: 1) Multi-branch parallel architectures, while capturing multi-scale temporal dependencies, inevitably lead to increased parameter and computational costs. 2) The binarized MIL constraint only ensures the interclass separability while neglecting the fine-grained discriminability within anomalous classes. To this end, we introduce a novel WS-VAD framework that focuses on efficient temporal modeling and anomaly innerclass discriminability. We first construct a Temporal Context Aggregation (TCA) module that simultaneously captures local-global dependencies by reusing an attention matrix along with adaptive context fusion. In addition, we propose a Prompt-Enhanced Learning (PEL) module that incorporates semantic priors using knowledge-based prompts to boost the discrimination of visual features while ensuring separability across anomaly subclasses. The proposed components have been validated through extensive experiments, which demonstrate superior performance on three challenging datasets, UCF-Crime, XD-Violence and ShanghaiTech, with fewer parameters and reduced computational effort. Notably, our method can significantly improve the detection accuracy for certain anomaly subclasses and reduced the false alarm rate. Our code is available at: https://github.com/yujiangpu20/PEL4VAD.

摘要

弱监督视频异常检测旨在定位未修剪视频中的异常活动，而无需帧级监督。先前的工作利用图卷积网络或自注意力机制，结合基于多实例学习（MIL）的分类损失来建模时间关系并学习判别特征。然而，这些方法在两个方面存在局限性：1）多分支并行架构在捕获多尺度时间依赖性的同时，不可避免地导致参数和计算成本增加。2）二值化的MIL约束仅确保类间可分离性，而忽略了异常类内的细粒度可辨别性。为此，我们引入了一种新颖的WS-VAD框架，该框架专注于高效的时间建模和异常类内可辨别性。我们首先构建了一个时间上下文聚合（TCA）模块，通过重用注意力矩阵以及自适应上下文融合来同时捕获局部-全局依赖性。此外，我们提出了一种提示增强学习（PEL）模块，该模块使用基于知识的提示合并语义先验，以增强视觉特征的辨别力，同时确保跨异常子类的可分离性。所提出的组件已通过广泛的实验得到验证，这些实验表明在三个具有挑战性的数据集UCF-Crime、XD-Violence和上海科技数据集上具有卓越的性能，参数更少且计算量减少。值得注意的是，我们的方法可以显著提高某些异常子类的检测准确率并降低误报率。我们的代码可在以下网址获取：https://github.com/yujiangpu20/PEL4VAD。

相似文献

Learning Prompt-Enhanced Context Features for Weakly-Supervised Video Anomaly Detection.学习用于弱监督视频异常检测的提示增强上下文特征。

IEEE Trans Image Process. 2024;33:4923-4936. doi: 10.1109/TIP.2024.3451935. Epub 2024 Sep 11.

Multimodal and multiscale feature fusion for weakly supervised video anomaly detection.用于弱监督视频异常检测的多模态和多尺度特征融合

Sci Rep. 2024 Oct 1;14(1):22835. doi: 10.1038/s41598-024-73462-0.

Learning Causal Temporal Relation and Feature Discrimination for Anomaly Detection.用于异常检测的因果时间关系学习与特征辨别

IEEE Trans Image Process. 2021;30:3513-3527. doi: 10.1109/TIP.2021.3062192. Epub 2021 Mar 11.

Cognitive Refined Augmentation for Video Anomaly Detection in Weak Supervision.弱监督下视频异常检测的认知精炼增强

Sensors (Basel). 2023 Dec 21;24(1):58. doi: 10.3390/s24010058.

Weakly Supervised Video Anomaly Detection via Self-Guided Temporal Discriminative Transformer.基于自引导时间判别变压器的弱监督视频异常检测

IEEE Trans Cybern. 2024 May;54(5):3197-3210. doi: 10.1109/TCYB.2022.3227044. Epub 2024 Apr 16.

TCGL: Temporal Contrastive Graph for Self-Supervised Video Representation Learning.TCGL：用于自监督视频表征学习的时间对比图

IEEE Trans Image Process. 2022;31:1978-1993. doi: 10.1109/TIP.2022.3147032. Epub 2022 Feb 18.

Localizing Anomalies From Weakly-Labeled Videos.从弱标注视频中定位异常

IEEE Trans Image Process. 2021;30:4505-4515. doi: 10.1109/TIP.2021.3072863. Epub 2021 Apr 28.

Injecting Text Clues for Improving Anomalous Event Detection From Weakly Labeled Videos.注入文本线索以改进从弱标注视频中检测异常事件

IEEE Trans Image Process. 2024;33:5907-5920. doi: 10.1109/TIP.2024.3477351. Epub 2024 Oct 18.

Weakly supervised temporal action localization with actionness-guided false positive suppression.基于动作引导型假阳性抑制的弱监督时间动作定位。

Neural Netw. 2024 Jul;175:106307. doi: 10.1016/j.neunet.2024.106307. Epub 2024 Apr 15.

Clustering Aided Weakly Supervised Training to Detect Anomalous Events in Surveillance Videos.聚类辅助的弱监督训练用于检测监控视频中的异常事件。

IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14085-14098. doi: 10.1109/TNNLS.2023.3274611. Epub 2024 Oct 7.

引用本文的文献

Enhanced Video Anomaly Detection Through Dual Triplet Contrastive Loss for Hard Sample Discrimination.通过双三元组对比损失进行硬样本判别以增强视频异常检测

Entropy (Basel). 2025 Jun 20;27(7):655. doi: 10.3390/e27070655.

Anomaly Detection Based on a 3D Convolutional Neural Network Combining Convolutional Block Attention Module Using Merged Frames.基于使用合并帧的卷积块注意力模块的3D卷积神经网络的异常检测

Sensors (Basel). 2023 Dec 4;23(23):9616. doi: 10.3390/s23239616.

CNN-ViT Supported Weakly-Supervised Video Segment Level Anomaly Detection.基于卷积神经网络-视觉Transformer的弱监督视频片段级异常检测

Sensors (Basel). 2023 Sep 7;23(18):7734. doi: 10.3390/s23187734.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

学习用于弱监督视频异常检测的提示增强上下文特征。

Learning Prompt-Enhanced Context Features for Weakly-Supervised Video Anomaly Detection.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献