Veltmeijer Emmeke, Franken Morris, Gerritsen Charlotte
Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 1111, 1081 HV Amsterdam, the Netherlands.
Amsterdam, the Netherlands.
Multimed Tools Appl. 2025;84(7):3793-3807. doi: 10.1007/s11042-024-19144-5. Epub 2024 May 1.
In an era of rapid technological advancements, computer systems play a crucial role in early Violence Detection (VD) and localization, which is critical for timely human intervention. However, existing VD methods often fall short, lacking applicability to surveillance data, and failing to address the localization and social dimension of violent events. To address these shortcomings, we propose a novel approach to integrate subgroups into VD. Our method recognizes and tracks multiple subgroups across frames, providing an additional layer of information in VD. This enables the system to not only detect violence at video-level, but also to identify the groups involved. This adaptable add-on module can enhance the applicability of existing models and algorithms. Through extensive experiments on the SCFD and RWF-2000 surveillance datasets, we find that our approach improves social awareness in real-time VD by localizing the people involved in an act of violence. The system offers a small performance boost on the SCFD dataset and maintains performance on RWF-2000, reaching 91.3% and 87.2% accuracy respectively, demonstrating its practical utility while performing close to state-of-the-art methods. Furthermore, our efficient method generalizes well to unseen datasets, marking a promising advance in early VD.
在技术飞速发展的时代,计算机系统在早期暴力检测(VD)和定位中起着至关重要的作用,这对于及时的人为干预至关重要。然而,现有的VD方法往往存在不足,缺乏对监控数据的适用性,并且未能解决暴力事件的定位和社会层面问题。为了解决这些缺点,我们提出了一种将子群体整合到VD中的新颖方法。我们的方法能够跨帧识别和跟踪多个子群体,在VD中提供额外的信息层。这使得系统不仅能够在视频级别检测暴力行为,还能够识别涉及的群体。这个可适应的附加模块可以提高现有模型和算法的适用性。通过在SCFD和RWF - 2000监控数据集上进行广泛实验,我们发现我们的方法通过定位暴力行为中的相关人员,提高了实时VD中的社会感知能力。该系统在SCFD数据集上性能略有提升,在RWF - 2000数据集上保持性能,准确率分别达到91.3%和87.2%,在性能接近最先进方法的同时展示了其实用性。此外,我们的高效方法能够很好地推广到未见过的数据集,标志着早期VD领域的一项有前途的进展。