Suppr超能文献

基于红外热成像技术和改进的YOLOv5s模型的黑暗水生环境中的人员检测

Personnel Detection in Dark Aquatic Environments Based on Infrared Thermal Imaging Technology and an Improved YOLOv5s Model.

作者信息

Cheng Liang, He Yunze, Mao Yankai, Liu Zhenkang, Dang Xiangzhao, Dong Yilong, Wu Liangliang

机构信息

School of Ocean Engineering, Jiangsu Ocean University, Lianyungang 222005, China.

Zhuhai Yunzhou Intelligence Technology Co., Ltd., Zhuhai 519085, China.

出版信息

Sensors (Basel). 2024 May 23;24(11):3321. doi: 10.3390/s24113321.

Abstract

This study presents a novel method for the nighttime detection of waterborne individuals using an enhanced YOLOv5s algorithm tailored for infrared thermal imaging. To address the unique challenges of nighttime water rescue operations, we have constructed a specialized dataset comprising 5736 thermal images collected from diverse aquatic environments. This dataset was further expanded through synthetic image generation using CycleGAN and a newly developed color gamut transformation technique, which significantly improves the data variance and model training effectiveness. Furthermore, we integrated the Convolutional Block Attention Module (CBAM) at the end of the last encoder's feedforward network. This integration maximizes the utilization of channel and spatial information to capture more intricate details in the feature maps. To decrease the computational demands of the network while maintaining model accuracy, Ghost convolution was employed, thereby boosting the inference speed as much as possible. Additionally, we applied hyperparameter evolution to refine the training parameters. The improved algorithm achieved an average detection accuracy of 85.49% on our proprietary dataset, significantly outperforming its predecessor, with a prediction speed of 23.51 FPS. The experimental outcomes demonstrate the proposed solution's high recognition capabilities and robustness, fulfilling the demands of intelligent lifesaving missions.

摘要

本研究提出了一种新颖的方法,用于使用针对红外热成像量身定制的增强型YOLOv5s算法在夜间检测水中的人员。为应对夜间水上救援行动的独特挑战,我们构建了一个专门的数据集,其中包含从不同水生环境收集的5736张热图像。通过使用CycleGAN和新开发的色域变换技术生成合成图像,进一步扩展了该数据集,这显著提高了数据方差和模型训练效果。此外,我们在最后一个编码器的前馈网络末尾集成了卷积块注意力模块(CBAM)。这种集成最大限度地利用了通道和空间信息,以捕获特征图中更复杂的细节。为了在保持模型准确性的同时降低网络的计算需求,采用了Ghost卷积,从而尽可能提高推理速度。此外,我们应用超参数进化来优化训练参数。改进后的算法在我们的专有数据集上实现了85.49%的平均检测准确率,显著优于其前身,预测速度为23.51 FPS。实验结果证明了所提出的解决方案具有很高的识别能力和鲁棒性,满足了智能救生任务的需求。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55fc/11175020/9d08649e1b34/sensors-24-03321-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验