Zhao Liang, Fu Lulu, Jia Xin, Cui Beibei, Zhu Xianchao, Jin Junwei
College of Electrical Engineering, Henan University of Technology, Zhengzhou 450001, China.
School of Artificial Intelligence and Big Data, Henan University of Technology, Zhengzhou 450001, China.
Sensors (Basel). 2024 Dec 19;24(24):8126. doi: 10.3390/s24248126.
In intelligent transportation systems, accurate vehicle target recognition within road scenarios is crucial for achieving intelligent traffic management. Addressing the challenges posed by complex environments and severe vehicle occlusion in such scenarios, this paper proposes a novel vehicle-detection method, YOLO-BOS. First, to bolster the feature-extraction capabilities of the backbone network, we propose a novel Bi-level Routing Spatial Attention (BRSA) mechanism, which selectively filters features based on task requirements and adjusts the importance of spatial locations to more accurately enhance relevant features. Second, we incorporate Omni-directional Dynamic Convolution (ODConv) into the head network, which is capable of simultaneously learning complementary attention across the four dimensions of the kernel space, therefore facilitating the capture of multifaceted features from the input data. Lastly, we introduce Shape-IOU, a new loss function that significantly enhances the accuracy and robustness of detection results for vehicles of varying sizes. Experimental evaluations conducted on the UA-DETRAC dataset demonstrate that our model achieves improvements of 4.7 and 4.4 percentage points in mAP@0.5 and mAP@0.5:0.95, respectively, compared to the baseline model. Furthermore, comparative experiments on the SODA10M dataset corroborate the superiority of our method in terms of precision and accuracy.
在智能交通系统中,道路场景内准确的车辆目标识别对于实现智能交通管理至关重要。针对此类场景中复杂环境和严重车辆遮挡带来的挑战,本文提出了一种新颖的车辆检测方法YOLO-BOS。首先,为增强骨干网络的特征提取能力,我们提出了一种新颖的双层路由空间注意力(BRSA)机制,该机制根据任务需求选择性地过滤特征,并调整空间位置的重要性,以更准确地增强相关特征。其次,我们将全向动态卷积(ODConv)纳入头部网络,它能够同时在核空间的四个维度上学习互补注意力,从而便于从输入数据中捕获多方面特征。最后,我们引入了Shape-IOU,一种新的损失函数,它显著提高了不同尺寸车辆检测结果的准确性和鲁棒性。在UA-DETRAC数据集上进行的实验评估表明,与基线模型相比,我们的模型在mAP@0.5和mAP@0.5:0.95上分别提高了4.7和4.4个百分点。此外,在SODA10M数据集上的对比实验证实了我们方法在精度和准确性方面的优越性。