• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于CPDD-YOLOv8的航空图像小目标检测模型

A small object detection model in aerial images based on CPDD-YOLOv8.

作者信息

Wang Jingyang, Gao Jiayao, Zhang Bo

机构信息

School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang, 050018, China.

School of Cyberspace Security, Hebei University of Engineering Science, Shijiazhuang, 050091, China.

出版信息

Sci Rep. 2025 Jan 4;15(1):770. doi: 10.1038/s41598-024-84938-4.

DOI:10.1038/s41598-024-84938-4
PMID:39755781
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11700085/
Abstract

Aerial images can cover a wide area and capture rich scene information. These images are often taken from a high altitude and contain many small objects. It is difficult to detect small objects accurately because their features are not obvious and are susceptible to background interference. The CPDD-YOLOv8 is proposed to improve the performance of small object detection. Firstly, we propose the C2fGAM structure, which integrates the Global Attention Mechanism (GAM) into the C2f structure of the backbone so that the model can better understand the overall semantics of the images. Secondly, a detection layer named P2 is added to extract the shallow features. Thirdly, a new DSC2f structure is proposed, which uses Dynamic Snake Convolution (DSConv) to take the place of the first standard Conv of Bottleneck in the C2f structure, so that the model can adapt to different inputs more effectively. Finally, the Dynamic Head (DyHead), which integrates multiple attention mechanisms, is used in the head to assign different weights to different feature layers. To prove the effectiveness of the CPDD-YOLOv8, we carry out ablation and comparison experiments on the VisDrone2019 dataset. Ablation experiments show that all the improved and added modules in CPDD-YOLOv8 are effective. Comparative experiments suggest that the mAP of CPDD-YOLOv8 is higher than the other seven comparison models. The mAP@0.5 of this model reaches 41%, which is 6.9% higher than that of YOLOv8. The CPDD-YOLOv8's small object detection rate is improved by 13.1%. The generalizability of the CPDD-YOLOv8 model is verified on the WiderPerson, VOC_MASK and SHWD datasets.

摘要

航空图像可以覆盖广阔区域并捕捉丰富的场景信息。这些图像通常是从高空拍摄的,包含许多小物体。由于小物体的特征不明显且易受背景干扰,因此很难准确检测到它们。提出CPDD-YOLOv8以提高小物体检测的性能。首先,我们提出了C2fGAM结构,将全局注意力机制(GAM)集成到主干的C2f结构中,以便模型能够更好地理解图像的整体语义。其次,添加了一个名为P2的检测层来提取浅层特征。第三,提出了一种新的DSC2f结构,它使用动态蛇形卷积(DSConv)代替C2f结构中瓶颈的第一个标准卷积,从而使模型能够更有效地适应不同的输入。最后,在头部使用集成了多种注意力机制的动态头部(DyHead),为不同的特征层分配不同的权重。为了证明CPDD-YOLOv8的有效性,我们在VisDrone2019数据集上进行了消融和对比实验。消融实验表明,CPDD-YOLOv8中所有改进和添加的模块都是有效的。对比实验表明,CPDD-YOLOv8的平均精度均值(mAP)高于其他七个对比模型。该模型的mAP@0.5达到41%,比YOLOv8高6.9%。CPDD-YOLOv8的小物体检测率提高了13.1%。在WiderPerson、VOC_MASK和SHWD数据集上验证了CPDD-YOLOv8模型的通用性。

相似文献

1
A small object detection model in aerial images based on CPDD-YOLOv8.基于CPDD-YOLOv8的航空图像小目标检测模型
Sci Rep. 2025 Jan 4;15(1):770. doi: 10.1038/s41598-024-84938-4.
2
SOD-YOLOv8-Enhancing YOLOv8 for Small Object Detection in Aerial Imagery and Traffic Scenes.SOD-YOLOv8——增强YOLOv8以用于航空图像和交通场景中的小目标检测
Sensors (Basel). 2024 Sep 25;24(19):6209. doi: 10.3390/s24196209.
3
HP-YOLOv8: High-Precision Small Object Detection Algorithm for Remote Sensing Images.HP-YOLOv8:用于遥感图像的高精度小目标检测算法
Sensors (Basel). 2024 Jul 26;24(15):4858. doi: 10.3390/s24154858.
4
UAV-YOLOv8: A Small-Object-Detection Model Based on Improved YOLOv8 for UAV Aerial Photography Scenarios.无人机 - YOLOv8:一种基于改进YOLOv8的用于无人机航拍场景的小目标检测模型。
Sensors (Basel). 2023 Aug 15;23(16):7190. doi: 10.3390/s23167190.
5
OBC-YOLOv8: an improved road damage detection model based on YOLOv8.OBC-YOLOv8:一种基于YOLOv8的改进型道路损伤检测模型。
PeerJ Comput Sci. 2025 Jan 7;11:e2593. doi: 10.7717/peerj-cs.2593. eCollection 2025.
6
YOLOv8-MPEB small target detection algorithm based on UAV images.基于无人机图像的YOLOv8 - MPEB小目标检测算法
Heliyon. 2024 Apr 15;10(8):e29501. doi: 10.1016/j.heliyon.2024.e29501. eCollection 2024 Apr 30.
7
SRE-YOLOv8: An Improved UAV Object Detection Model Utilizing Swin Transformer and RE-FPN.SRE-YOLOv8:一种利用Swin Transformer和RE-FPN的改进型无人机目标检测模型。
Sensors (Basel). 2024 Jun 17;24(12):3918. doi: 10.3390/s24123918.
8
YOLOv5_mamba: unmanned aerial vehicle object detection based on bidirectional dense feedback network and adaptive gate feature fusion.YOLOv5_mamba:基于双向密集反馈网络和自适应门特征融合的无人机目标检测
Sci Rep. 2024 Sep 27;14(1):22396. doi: 10.1038/s41598-024-73241-x.
9
MST-YOLO: Small Object Detection Model for Autonomous Driving.MST-YOLO:用于自动驾驶的小目标检测模型
Sensors (Basel). 2024 Nov 18;24(22):7347. doi: 10.3390/s24227347.
10
A lightweight wheat ear counting model in UAV images based on improved YOLOv8.一种基于改进YOLOv8的无人机图像中小麦穗计数轻量级模型。
Front Plant Sci. 2025 Feb 11;16:1536017. doi: 10.3389/fpls.2025.1536017. eCollection 2025.

引用本文的文献

1
LRDS-YOLO enhances small object detection in UAV aerial images with a lightweight and efficient design.LRDS-YOLO通过轻量级且高效的设计增强了无人机航拍图像中的小目标检测能力。
Sci Rep. 2025 Jul 2;15(1):22627. doi: 10.1038/s41598-025-07021-6.
2
Enhancing wind turbine blade damage detection with YOLO-Wind.利用YOLO-Wind增强风力涡轮机叶片损伤检测
Sci Rep. 2025 May 28;15(1):18667. doi: 10.1038/s41598-025-03639-8.

本文引用的文献

1
Student Behavior Detection in the Classroom Based on Improved YOLOv8.基于改进YOLOv8的课堂学生行为检测
Sensors (Basel). 2023 Oct 11;23(20):8385. doi: 10.3390/s23208385.
2
LPO-YOLOv5s: A Lightweight Pouring Robot Object Detection Algorithm.LPO-YOLOv5s:一种轻量级浇注机器人目标检测算法。
Sensors (Basel). 2023 Jul 14;23(14):6399. doi: 10.3390/s23146399.
3
Rebalanced Zero-Shot Learning.重新平衡的零样本学习
IEEE Trans Image Process. 2023;32:4185-4198. doi: 10.1109/TIP.2023.3295738. Epub 2023 Jul 25.
4
Detection and Tracking Meet Drones Challenge.检测与跟踪遭遇无人机挑战。
IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):7380-7399. doi: 10.1109/TPAMI.2021.3119563. Epub 2022 Oct 4.
5
Three Dimensional UAV Positioning for Dynamic UAV-to-Car Communications.三维无人机定位在动态无人机到车通信中的应用。
Sensors (Basel). 2020 Jan 8;20(2):356. doi: 10.3390/s20020356.