基于GR-YOLO的密集行人检测

Dense Pedestrian Detection Based on GR-YOLO.

作者信息

Li Nianfeng, Bai Xinlu, Shen Xiangfeng, Xin Peizeng, Tian Jia, Chai Tengfei, Wang Zhenyan

机构信息

College of Computer Science and Technology, Changchun University, No. 6543, Satellite Road, Changchun 130022, China.

出版信息

Sensors (Basel). 2024 Jul 22;24(14):4747. doi: 10.3390/s24144747.

DOI:10.3390/s24144747

PMID:39066144

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11281216/

Abstract

In large public places such as railway stations and airports, dense pedestrian detection is important for safety and security. Deep learning methods provide relatively effective solutions but still face problems such as feature extraction difficulties, image multi-scale variations, and high leakage detection rates, which bring great challenges to the research in this field. In this paper, we propose an improved dense pedestrian detection algorithm GR-yolo based on Yolov8. GR-yolo introduces the repc3 module to optimize the backbone network, which enhances the ability of feature extraction, adopts the aggregation-distribution mechanism to reconstruct the yolov8 neck structure, fuses multi-level information, achieves a more efficient exchange of information, and enhances the detection ability of the model. Meanwhile, the Giou loss calculation is used to help GR-yolo converge better, improve the detection accuracy of the target position, and reduce missed detection. Experiments show that GR-yolo has improved detection performance over yolov8, with a 3.1% improvement in detection means accuracy on the wider people dataset, 7.2% on the crowd human dataset, and 11.7% on the people detection images dataset. Therefore, the proposed GR-yolo algorithm is suitable for dense, multi-scale, and scene-variable pedestrian detection, and the improvement also provides a new idea to solve dense pedestrian detection in real scenes.

摘要

在火车站和机场等大型公共场所，密集行人检测对于安全保障至关重要。深度学习方法提供了相对有效的解决方案，但仍面临特征提取困难、图像多尺度变化以及高漏检率等问题，这给该领域的研究带来了巨大挑战。在本文中，我们提出了一种基于Yolov8的改进型密集行人检测算法GR-yolo。GR-yolo引入repc3模块来优化主干网络，增强了特征提取能力，采用聚合-分布机制重构Yolov8的颈部结构，融合多级别信息，实现了更高效的信息交换，提升了模型的检测能力。同时，使用Giou损失计算来帮助GR-yolo更好地收敛，提高目标位置的检测精度，减少漏检。实验表明，GR-yolo在检测性能上优于Yolov8，在更广泛的行人数据集上检测平均准确率提高了3.1%，在人群行人数据集上提高了7.2%，在行人检测图像数据集上提高了11.7%。因此，所提出的GR-yolo算法适用于密集、多尺度和场景可变的行人检测，该改进也为解决实际场景中的密集行人检测提供了新思路。