School of Automotive and Traffic Engineering, Nanjing Forestry University, Nanjing 210037, China.
School of Energy and Power Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China.
Sensors (Basel). 2022 Nov 4;22(21):8480. doi: 10.3390/s22218480.
Object detection is a critical technology of environmental perception for autonomous driving vehicle. The Convolutional Neural Network has gradually become a powerful tool in the field of vehicle detection because of its powerful ability of feature extraction. In aiming to reach the balance between speed and accuracy of detection in complex traffic scenarios, this paper proposes an improved lightweight and high-performance vehicle-pedestrian detection algorithm based on the YOLOv4. Firstly, the backbone network CSPDarknet53 is replaced by MobileNetv2 to reduce the number of parameters and raise the capability of feature extraction. Secondly, the method of multi-scale feature fusion is used to realize the information interaction among different feature layers. Finally, a coordinate attention mechanism is added to focus on the region of interest in the image by way of weight adjustment. The experimental results show that this improved model has a great performance in vehicle-pedestrian detection in traffic scenarios. Experimental results on PASCAL VOC datasets show that the improved model's is 85.79% and speed is 35FPS, which has an increase of 4.31% and 16.7% compared to YOLOv4. Furthermore, the improved YOLOv4 model maintains a great balance between detection accuracy and speed on different datasets, indicating that it can be applied to vehicle-pedestrian detection in traffic scenarios.
目标检测是自动驾驶车辆环境感知的关键技术。卷积神经网络因其强大的特征提取能力,逐渐成为车辆检测领域的有力工具。本文针对复杂交通场景下检测的速度与精度的平衡问题,提出了一种基于 YOLOv4 的改进轻量级、高性能的车辆-行人检测算法。首先,用 MobileNetv2 替换骨干网络 CSPDarknet53,减少参数数量,提高特征提取能力。其次,采用多尺度特征融合方法实现不同特征层之间的信息交互。最后,添加坐标注意力机制,通过权重调整,关注图像中的感兴趣区域。实验结果表明,该改进模型在交通场景下的车辆-行人检测中具有很好的性能。在 PASCAL VOC 数据集上的实验结果表明,改进后的模型的精度为 85.79%,速度为 35FPS,与 YOLOv4 相比分别提高了 4.31%和 16.7%。此外,改进后的 YOLOv4 模型在不同数据集上的检测精度和速度之间保持了很好的平衡,表明它可以应用于交通场景中的车辆-行人检测。