P Shanthi, V Manjula
School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, 600 127, India.
Sci Rep. 2025 Jul 23;15(1):26766. doi: 10.1038/s41598-025-07782-0.
In modern days, increasing weapon-related threats in public places have created an immediate need for intelligent surveillance systems to detect crime in real-time. Traditional surveillance systems have struggles with recognizing small objects, occlusion, and the time it takes to respond, which makes them ineffective in crowded and fast-changing situations. To overcome these challenges, the suggested system combines closed-circuit television (CCTV) surveillance cameras with advanced deep learning methods, image processing, and computer vision techniques for real-time crime prediction and prevention. This study proposes a hybrid deep learning framework that merges a Faster region convolutional neural network and Mask Region Convolutional Neural Network, named FMR-CNN. The novel approach FMR-CNN represents a significant advancement towards improving object recognition and segmentation of images and videos. It has been combined with YOLOv8 to increase the real-time detection speed and localization accuracy significantly. Such a combination enables the concurrent utilization of high-resolution spatial context information and rapid frame-wise predictions, thus making it well-suited for continuous video surveillance tasks. The model was trained and tested on a five labeled class annotated dataset, where MobileNetV3 features are extracted to simulate real-world surveillance conditions. Experimental results show the hybrid model attains detection accuracy of 98.7%, average precision (AP) of 90.1, and speed of 9.2 frames per second (FPS), and generalizes to varied lighting, occlusion, object scales, and reduced computational complexity, making it highly effective for crime prevention. Using these models benefits police departments and law enforcement agencies, as it allows them to detect criminal offenses earlier and avoid untoward situations.
在现代,公共场所与武器相关的威胁日益增加,这使得迫切需要智能监控系统来实时检测犯罪行为。传统监控系统在识别小物体、遮挡情况以及响应时间方面存在困难,这使得它们在拥挤且快速变化的场景中效果不佳。为了克服这些挑战,所建议的系统将闭路电视(CCTV)监控摄像头与先进的深度学习方法、图像处理和计算机视觉技术相结合,以进行实时犯罪预测和预防。本研究提出了一种混合深度学习框架,该框架融合了更快区域卷积神经网络和掩码区域卷积神经网络,名为FMR-CNN。新颖的FMR-CNN方法在提高图像和视频的目标识别与分割方面取得了显著进展。它已与YOLOv8相结合,以显著提高实时检测速度和定位精度。这种结合能够同时利用高分辨率空间上下文信息和快速逐帧预测,因此非常适合连续视频监控任务。该模型在一个带有五个标注类别的数据集上进行训练和测试,在该数据集中提取MobileNetV3特征以模拟现实世界的监控条件。实验结果表明,该混合模型的检测准确率达到98.7%,平均精度(AP)为90.1,速度为每秒9.2帧(FPS),并且能够推广到不同的光照、遮挡、目标尺度情况,同时降低了计算复杂度,使其在预防犯罪方面非常有效。使用这些模型对警察部门和执法机构有益,因为这使他们能够更早地检测到刑事犯罪并避免不良情况的发生。