Zhou Yan
Ocean College, Zhejiang University, Zhoushan, 316021, China.
State Key Laboratory of Satellite Ocean Environment Dynamics, Second Institute of Oceanography, Ministry of Natural Resources, Hangzhou, 310012, China.
Heliyon. 2023 Aug 9;9(8):e19064. doi: 10.1016/j.heliyon.2023.e19064. eCollection 2023 Aug.
Object detection is a fundamental task in computer vision that aims to locate and classify objects in images or videos. The one-stage You Only Look Once (YOLO) models are popular approaches to object detection. Real-time monitoring of mask wearing is necessary, especially for preventing the spread of the COVID-19 virus. While YOLO detectors facing challenges include improving the robustness of object detectors against occlusion, scale variation, handling false detection and false negative, and maintaining the balance between higher precision detection and faster inference time. In this study, a novel object detection model called Improved You Only Look Once and None Left (IYOLO-NL) based on YOLOv5 was proposed for real-time mask wearing detection. To fulfill the requirement of real-time detection, the lightweight IYOLO-NL was developed by using novel CSPNet-Ghost and SSPP bottleneck architecture. To prevent any missed correct results, IYOLO-NL integrates the proposed PANet-SC with a multi-level prediction scheme. To achieve high precision and handle sample allocation properly, the proposed global dynamic-k label assignment strategy was utilized in an anchor-free manner. A large dataset of face masks (FMD) was created, consisting of 6130 images, for use in conducting experiments on IYOLO-NL and other models. The experiment results show that IYOLO-NL surpasses other state-of-the-art (SOTA) methods and achieves 98.8% accuracy while maintaining 130 FPS.
目标检测是计算机视觉中的一项基本任务,旨在定位和分类图像或视频中的物体。单阶段的“你只看一次”(YOLO)模型是目标检测的流行方法。对佩戴口罩进行实时监测很有必要,特别是对于预防新冠病毒的传播。虽然YOLO检测器面临的挑战包括提高目标检测器对遮挡、尺度变化的鲁棒性,处理误检和漏检,并在更高精度检测和更快推理时间之间保持平衡。在本研究中,提出了一种基于YOLOv5的名为改进的“你只看一次且无一遗漏”(IYOLO-NL)的新型目标检测模型,用于实时口罩佩戴检测。为了满足实时检测的要求,通过使用新颖的CSPNet-Ghost和SSPP瓶颈架构开发了轻量级的IYOLO-NL。为了防止任何正确结果被遗漏,IYOLO-NL将所提出的PANet-SC与多级预测方案相结合。为了实现高精度并正确处理样本分配,以无锚点的方式采用了所提出的全局动态k标签分配策略。创建了一个由6130张图像组成的大型口罩数据集(FMD),用于对IYOLO-NL和其他模型进行实验。实验结果表明,IYOLO-NL超越了其他先进(SOTA)方法,在保持130帧每秒的同时,准确率达到了98.8%。