Song Qi, Yao Bodan, Xue Yunlong, Ji Shude
School of Automation, Shenyang Aerospace University, Shenyang 110136, China.
State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Beijing 100045, China.
Sensors (Basel). 2024 Oct 30;24(21):6955. doi: 10.3390/s24216955.
A novel detection model, MS-YOLO, is developed in this paper to improve the efficiency of drowning rescue operations. The model is lightweight, high in precision, and applicable for intelligent hardware platforms. Firstly, the MD-C2F structure is built to capture the subtle movements and posture changes in various aquatic environments, with a light weight achieved by introducing dynamic convolution (DcConv). To make the model perform better in small object detection, the EMA mechanism is incorporated into the MD-C2F. Secondly, the MSI-SPPF module is constructed to improve the performance in identifying the features of different scales and the understanding of complex backgrounds. Finally, the ConCat single-channel fusion is replaced by BiFPN weighted channel fusion to retain more feature information and remove the irrelevant information in drowning features. Relative to the Faster R-CNN, SSD, YOLOv6, YOLOv9, and YOLOv10, the MS-YOLO achieves an average accuracy of 86.4% in detection on a self-built dataset at an ultra-low computational cost of 7.3 GFLOPs.
本文开发了一种新型检测模型MS-YOLO,以提高溺水救援行动的效率。该模型轻量级、精度高,适用于智能硬件平台。首先,构建MD-C2F结构以捕捉各种水生环境中的细微动作和姿势变化,通过引入动态卷积(DcConv)实现轻量级。为使模型在小目标检测中表现更好,将EMA机制纳入MD-C2F。其次,构建MSI-SPPF模块以提高在识别不同尺度特征和理解复杂背景方面的性能。最后,将ConCat单通道融合替换为BiFPN加权通道融合,以保留更多特征信息并去除溺水特征中的无关信息。相对于Faster R-CNN、SSD、YOLOv6、YOLOv9和YOLOv10,MS-YOLO在自建数据集上进行检测时,以7.3 GFLOPs的超低计算成本实现了86.4%的平均准确率。