Huang Xianning, Zhang Yaping
School of Information Science and Technology, Yunnan Normal University, Kunming 650500, China.
Sensors (Basel). 2023 Dec 24;24(1):102. doi: 10.3390/s24010102.
To address the problem of low recall rate in the detection of prohibited items in X-ray images due to the severe object occlusion and complex background, an X-ray prohibited item detection network, ScanGuard-YOLO, based on the YOLOv5 architecture, is proposed to effectively improve the model's recall rate and the comprehensive metric F1 score. Firstly, the RFB-s module was added to the end part of the backbone, and dilated convolution was used to increase the receptive field of the backbone network to better capture global features. In the neck section, the efficient RepGFPN module was employed to fuse multiscale information from the backbone output. This aimed to capture details and contextual information at various scales, thereby enhancing the model's understanding and representation capability of the object. Secondly, a novel detection head was introduced to unify scale-awareness, spatial-awareness, and task-awareness altogether, which significantly improved the representation ability of the object detection heads. Finally, the bounding box regression loss function was defined as the WIOUv3 loss, effectively balancing the contribution of low-quality and high-quality samples to the loss. ScanGuard-YOLO was tested on OPIXray and HiXray datasets, showing significant improvements compared to the baseline model. The mean average precision (mAP@0.5) increased by 2.3% and 1.6%, the recall rate improved by 4.5% and 2%, and the F1 score increased by 2.3% and 1%, respectively. The experimental results demonstrate that ScanGuard-YOLO effectively enhances the detection capability of prohibited items in complex backgrounds and exhibits broad prospects for application.
为了解决由于严重的物体遮挡和复杂背景导致的X射线图像中违禁物品检测召回率低的问题,提出了一种基于YOLOv5架构的X射线违禁物品检测网络ScanGuard-YOLO,以有效提高模型的召回率和综合指标F1分数。首先,在主干网络的末端添加了RFB-s模块,并使用空洞卷积来增加主干网络的感受野,以便更好地捕捉全局特征。在颈部部分,采用了高效的RepGFPN模块来融合主干网络输出的多尺度信息。这旨在捕捉不同尺度的细节和上下文信息,从而增强模型对物体的理解和表示能力。其次,引入了一种新颖的检测头,将尺度感知、空间感知和任务感知统一起来,显著提高了目标检测头的表示能力。最后,将边界框回归损失函数定义为WIOUv3损失,有效平衡了低质量和高质量样本对损失的贡献。ScanGuard-YOLO在OPIXray和HiXray数据集上进行了测试,与基线模型相比有显著改进。平均精度均值(mAP@0.5)分别提高了2.3%和1.6%,召回率提高了4.5%和2%,F1分数分别提高了2.3%和1%。实验结果表明,ScanGuard-YOLO有效地增强了复杂背景下违禁物品的检测能力,具有广阔的应用前景。