College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua 321004, China.
Sensors (Basel). 2021 Nov 8;21(21):7422. doi: 10.3390/s21217422.
In terms of small objects in traffic scenes, general object detection algorithms have low detection accuracy, high model complexity, and slow detection speed. To solve the above problems, an improved algorithm (named YOLO-MXANet) is proposed in this paper. Complete-Intersection over Union (CIoU) is utilized to improve loss function for promoting the positioning accuracy of the small object. In order to reduce the complexity of the model, we present a lightweight yet powerful backbone network (named SA-MobileNeXt) that incorporates channel and spatial attention. Our approach can extract expressive features more effectively by applying the Shuffle Channel and Spatial Attention (SCSA) module into the SandGlass Block (SGBlock) module while increasing the parameters by a small number. In addition, the data enhancement method combining Mosaic and Mixup is employed to improve the robustness of the training model. The Multi-scale Feature Enhancement Fusion (MFEF) network is proposed to fuse the extracted features better. In addition, the SiLU activation function is utilized to optimize the Convolution-Batchnorm-Leaky ReLU (CBL) module and the SGBlock module to accelerate the convergence of the model. The ablation experiments on the KITTI dataset show that each improved method is effective. The improved algorithm reduces the complexity and detection speed of the model while improving the object detection accuracy. The comparative experiments on the KITTY dataset and CCTSDB dataset with other algorithms show that our algorithm also has certain advantages.
在交通场景中的小物体方面,一般的目标检测算法检测精度低、模型复杂度高、检测速度慢。为了解决上述问题,本文提出了一种改进的算法(命名为 YOLO-MXANet)。利用完全交并比(CIoU)来改进损失函数,以提高小物体的定位精度。为了降低模型的复杂度,我们提出了一个轻量级但功能强大的骨干网络(命名为 SA-MobileNeXt),该网络结合了通道和空间注意力。我们的方法通过在沙玻璃块(SGBlock)模块中应用 Shuffle Channel 和 Spatial Attention(SCSA)模块,在增加少量参数的情况下,更有效地提取有表现力的特征。此外,还采用了结合 Mosaic 和 Mixup 的数据增强方法来提高训练模型的鲁棒性。提出了多尺度特征增强融合(MFEF)网络来更好地融合提取的特征。此外,利用 SiLU 激活函数来优化卷积-批归一化-泄漏线性单元(CBL)模块和 SGBlock 模块,以加速模型的收敛。在 KITTI 数据集上的消融实验表明,每种改进方法都是有效的。改进后的算法在提高目标检测精度的同时降低了模型的复杂度和检测速度。在 KITTY 数据集和 CCTSDB 数据集上与其他算法的对比实验表明,我们的算法也具有一定的优势。