Suppr超能文献

ARSOD-YOLO:增强遥感图像的小目标检测

ARSOD-YOLO: Enhancing Small Target Detection for Remote Sensing Images.

作者信息

Qiu Yijuan, Zheng Xiangyue, Hao Xuying, Zhang Gang, Lei Tao, Jiang Ping

机构信息

National Laboratory on Adaptive Optics, Chengdu 610209, China.

University of Chinese Academy of Sciences, Beijing 101408, China.

出版信息

Sensors (Basel). 2024 Nov 23;24(23):7472. doi: 10.3390/s24237472.

Abstract

Remote sensing images play a vital role in domains including environmental monitoring, agriculture, and autonomous driving. However, the detection of targets in remote sensing images remains a challenging task. This study introduces innovative methods to enhance feature extraction, feature fusion, and model optimization. The Adaptive Selective Feature Enhancement Module (AFEM) dynamically adjusts feature weights using GhostModule and sigmoid functions, thereby enhancing the accuracy of small target detection. Moreover, the Adaptive Multi-scale Convolution Kernel Feature Fusion Module (AKSFFM) enhances feature fusion through multi-scale convolution operations and attention weight learning mechanisms. Moreover, our proposed ARSOD-YOLO optimized the network architecture, component modules, and loss functions based on YOLOv8, enhancing outstanding small target detection capabilities while preserving model efficiency. We conducted experiments on the VEDAI and AI-TOD datasets, showcasing the excellent performance of ARSOD-YOLO. Our algorithm achieved an mAP50 of 74.3% on the VEDAI dataset, surpassing the YOLOv8 baseline by 3.1%. Similarly, on the AI-TOD dataset, the mAP50 reached 47.8%, exceeding the baseline network by 6.1%.

摘要

遥感图像在环境监测、农业和自动驾驶等领域发挥着至关重要的作用。然而,遥感图像中的目标检测仍然是一项具有挑战性的任务。本研究引入了创新方法来增强特征提取、特征融合和模型优化。自适应选择性特征增强模块(AFEM)使用GhostModule和sigmoid函数动态调整特征权重,从而提高小目标检测的准确性。此外,自适应多尺度卷积核特征融合模块(AKSFFM)通过多尺度卷积操作和注意力权重学习机制增强特征融合。此外,我们提出的ARSOD-YOLO基于YOLOv8对网络架构、组件模块和损失函数进行了优化,在保持模型效率的同时增强了出色的小目标检测能力。我们在VEDAI和AI-TOD数据集上进行了实验,展示了ARSOD-YOLO的优异性能。我们的算法在VEDAI数据集上的mAP50达到了74.3%,比YOLOv8基线高出3.1%。同样,在AI-TOD数据集上,mAP50达到了47.8%,超过基线网络6.1%。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab85/11644057/38d3b9cdbcc8/sensors-24-07472-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验