Faculty of Vehicle Engineering and Mechanics, School of Automotive Engineering, Dalian University of Technology, Dalian 116024, China.
Sensors (Basel). 2021 Apr 26;21(9):3031. doi: 10.3390/s21093031.
There are many small objects in traffic scenes, but due to their low resolution and limited information, their detection is still a challenge. Small object detection is very important for the understanding of traffic scene environments. To improve the detection accuracy of small objects in traffic scenes, we propose a small object detection method in traffic scenes based on attention feature fusion. First, a multi-scale channel attention block (MS-CAB) is designed, which uses local and global scales to aggregate the effective information of the feature maps. Based on this block, an attention feature fusion block (AFFB) is proposed, which can better integrate contextual information from different layers. Finally, the AFFB is used to replace the linear fusion module in the object detection network and obtain the final network structure. The experimental results show that, compared to the benchmark model YOLOv5s, this method has achieved a higher mean Average Precison (mAP) under the premise of ensuring real-time performance. It increases the mAP of all objects by 0.9 percentage points on the validation set of the traffic scene dataset BDD100K, and at the same time, increases the mAP of small objects by 3.5%.
交通场景中有许多小物体,但由于它们的分辨率低且信息量有限,因此它们的检测仍然是一个挑战。小物体检测对于理解交通场景环境非常重要。为了提高交通场景中小物体的检测精度,我们提出了一种基于注意力特征融合的交通场景小物体检测方法。首先,设计了一个多尺度通道注意力块(MS-CAB),它使用局部和全局尺度来聚合特征图的有效信息。基于这个块,提出了一个注意力特征融合块(AFFB),它可以更好地融合来自不同层的上下文信息。最后,使用 AFFB 替换目标检测网络中的线性融合模块,得到最终的网络结构。实验结果表明,与基准模型 YOLOv5s 相比,在保证实时性能的前提下,该方法在交通场景数据集 BDD100K 的验证集上实现了更高的平均精度(mAP)。它将所有物体的 mAP 提高了 0.9 个百分点,同时将小物体的 mAP 提高了 3.5%。