• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

监控视频中的弱监督暴力检测。

Weakly Supervised Violence Detection in Surveillance Video.

机构信息

Department of Computer Science, Universidad Católica San Pablo, Arequipa 04001, Peru.

Department of Computer Science, Federal University of Ouro Preto, Ouro Preto 35400-000, Brazil.

出版信息

Sensors (Basel). 2022 Jun 14;22(12):4502. doi: 10.3390/s22124502.

DOI:10.3390/s22124502
PMID:35746286
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9231349/
Abstract

Automatic violence detection in video surveillance is essential for social and personal security. Monitoring the large number of surveillance cameras used in public and private areas is challenging for human operators. The manual nature of this task significantly increases the possibility of ignoring important events due to human limitations when paying attention to multiple targets at a time. Researchers have proposed several methods to detect violent events automatically to overcome this problem. So far, most previous studies have focused only on classifying short clips without performing spatial localization. In this work, we tackle this problem by proposing a weakly supervised method to detect spatially and temporarily violent actions in surveillance videos using only video-level labels. The proposed method follows a Fast-RCNN style architecture, that has been temporally extended. First, we generate spatiotemporal proposals (action tubes) leveraging pre-trained person detectors, motion appearance (dynamic images), and tracking algorithms. Then, given an input video and the action proposals, we extract spatiotemporal features using deep neural networks. Finally, a classifier based on multiple-instance learning is trained to label each action tube as violent or non-violent. We obtain similar results to the state of the art in three public databases Hockey Fight, RLVSD, and RWF-2000, achieving an accuracy of 97.3%, 92.88%, 88.7%, respectively.

摘要

自动的视频监控中的暴力检测对于社会和个人安全至关重要。监控公共和私人区域中使用的大量监控摄像机对人类操作员来说具有挑战性。由于人类在同时关注多个目标时存在注意力的限制,因此这种任务的人工性质大大增加了忽略重要事件的可能性。研究人员已经提出了几种自动检测暴力事件的方法来克服这个问题。到目前为止,大多数先前的研究仅关注于不进行空间定位的短片段分类。在这项工作中,我们通过提出一种仅使用视频级标签来检测监控视频中空间和时间暴力行为的弱监督方法来解决这个问题。所提出的方法遵循 Fast-RCNN 风格的架构,该架构已经在时间上进行了扩展。首先,我们利用预先训练好的人体检测器、运动外观(动态图像)和跟踪算法生成时空建议(动作管)。然后,给定一个输入视频和动作建议,我们使用深度神经网络提取时空特征。最后,基于多实例学习的分类器被训练来标记每个动作管是暴力的还是非暴力的。我们在三个公共数据库 Hockey Fight、RLVSD 和 RWF-2000 上获得了与最先进技术相当的结果,分别达到了 97.3%、92.88%和 88.7%的准确率。

相似文献

1
Weakly Supervised Violence Detection in Surveillance Video.监控视频中的弱监督暴力检测。
Sensors (Basel). 2022 Jun 14;22(12):4502. doi: 10.3390/s22124502.
2
Violence detection in surveillance video using low-level features.基于底层特征的监控视频中的暴力检测。
PLoS One. 2018 Oct 3;13(10):e0203668. doi: 10.1371/journal.pone.0203668. eCollection 2018.
3
Efficient Violence Detection in Surveillance.高效监控中的暴力检测。
Sensors (Basel). 2022 Mar 13;22(6):2216. doi: 10.3390/s22062216.
4
Bus Violence: An Open Benchmark for Video Violence Detection on Public Transport.公交车暴力:公共交通视频暴力检测的公开基准
Sensors (Basel). 2022 Oct 31;22(21):8345. doi: 10.3390/s22218345.
5
Fight Recognition in video using Hough Forests and 2D Convolutional Neural Network.使用 Hough 森林和 2D 卷积神经网络进行视频中的目标识别。
IEEE Trans Image Process. 2018 Oct;27(10):4787-4797. doi: 10.1109/TIP.2018.2845742. Epub 2018 Jun 8.
6
Integrating Spatial and Temporal Information for Violent Activity Detection from Video Using Deep Spiking Neural Networks.利用深度尖峰神经网络从视频中整合时空信息进行暴力活动检测。
Sensors (Basel). 2023 May 6;23(9):4532. doi: 10.3390/s23094532.
7
Deep Graph Metric Learning for Weakly Supervised Person Re-Identification.深度图度量学习在弱监督行人再识别中的应用。
IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):6074-6093. doi: 10.1109/TPAMI.2021.3084613. Epub 2022 Sep 14.
8
Real-time multiple spatiotemporal action localization and prediction approach using deep learning.基于深度学习的实时多时空动作定位与预测方法。
Neural Netw. 2020 Aug;128:331-344. doi: 10.1016/j.neunet.2020.05.017. Epub 2020 May 19.
9
Anomaly Detection in Traffic Surveillance Videos Using Deep Learning.基于深度学习的交通监控视频异常检测。
Sensors (Basel). 2022 Aug 31;22(17):6563. doi: 10.3390/s22176563.
10
A dataset for automatic violence detection in videos.一个用于视频中暴力行为自动检测的数据集。
Data Brief. 2020 Nov 26;33:106587. doi: 10.1016/j.dib.2020.106587. eCollection 2020 Dec.

本文引用的文献

1
A Survey of the Techniques for The Identification and Classification of Human Actions from Visual Data.基于视觉数据的人类动作识别与分类技术综述。
Sensors (Basel). 2018 Nov 15;18(11):3979. doi: 10.3390/s18113979.
2
Violence detection in surveillance video using low-level features.基于底层特征的监控视频中的暴力检测。
PLoS One. 2018 Oct 3;13(10):e0203668. doi: 10.1371/journal.pone.0203668. eCollection 2018.
3
Fight Recognition in video using Hough Forests and 2D Convolutional Neural Network.使用 Hough 森林和 2D 卷积神经网络进行视频中的目标识别。
IEEE Trans Image Process. 2018 Oct;27(10):4787-4797. doi: 10.1109/TIP.2018.2845742. Epub 2018 Jun 8.
4
Action Recognition with Dynamic Image Networks.基于动态图像网络的动作识别
IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):2799-2813. doi: 10.1109/TPAMI.2017.2769085. Epub 2017 Nov 2.