• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于目标检测的集中式特征金字塔

Centralized Feature Pyramid for Object Detection.

作者信息

Quan Yu, Zhang Dong, Zhang Liyan, Tang Jinhui

出版信息

IEEE Trans Image Process. 2023;32:4341-4354. doi: 10.1109/TIP.2023.3297408. Epub 2023 Aug 2.

DOI:10.1109/TIP.2023.3297408
PMID:37490376
Abstract

The visual feature pyramid has shown its superiority in both effectiveness and efficiency in a variety of applications. However, current methods overly focus on inter-layer feature interactions while disregarding the importance of intra-layer feature regulation. Despite some attempts to learn a compact intra-layer feature representation with the use of attention mechanisms or vision transformers, they overlook the crucial corner regions that are essential for dense prediction tasks. To address this problem, we propose a Centralized Feature Pyramid (CFP) network for object detection, which is based on a globally explicit centralized feature regulation. Specifically, we first propose a spatial explicit visual center scheme, where a lightweight MLP is used to capture the globally long-range dependencies, and a parallel learnable visual center mechanism is used to capture the local corner regions of the input images. Based on this, we then propose a globally centralized regulation for the commonly-used feature pyramid in a top-down fashion, where the explicit visual center information obtained from the deepest intra-layer feature is used to regulate frontal shallow features. Compared to the existing feature pyramids, CFP not only has the ability to capture the global long-range dependencies but also efficiently obtain an all-round yet discriminative feature representation. Experimental results on the challenging MS-COCO validate that our proposed CFP can achieve consistent performance gains on the state-of-the-art YOLOv5 and YOLOX object detection baselines.

摘要

视觉特征金字塔在各种应用中已展现出其在有效性和效率方面的优势。然而,当前方法过度关注层间特征交互,却忽视了层内特征调节的重要性。尽管有一些尝试通过注意力机制或视觉变换器来学习紧凑的层内特征表示,但它们忽略了对密集预测任务至关重要的关键角落区域。为了解决这个问题,我们提出了一种用于目标检测的集中式特征金字塔(CFP)网络,它基于全局显式的集中式特征调节。具体而言,我们首先提出一种空间显式视觉中心方案,其中使用轻量级多层感知器来捕获全局长程依赖,并使用并行可学习视觉中心机制来捕获输入图像的局部角落区域。基于此,我们随后以自上而下的方式对常用特征金字塔提出全局集中调节,其中从最深层内特征获得的显式视觉中心信息用于调节前面的浅层特征。与现有特征金字塔相比,CFP不仅具有捕获全局长程依赖的能力,还能有效地获得全面且有区分性的特征表示。在具有挑战性的MS-COCO上的实验结果验证了我们提出的CFP能够在最先进的YOLOv5和YOLOX目标检测基准上实现一致的性能提升。

相似文献

1
Centralized Feature Pyramid for Object Detection.用于目标检测的集中式特征金字塔
IEEE Trans Image Process. 2023;32:4341-4354. doi: 10.1109/TIP.2023.3297408. Epub 2023 Aug 2.
2
Attentional feature pyramid network for small object detection.注意特征金字塔网络用于小目标检测。
Neural Netw. 2022 Nov;155:439-450. doi: 10.1016/j.neunet.2022.08.029. Epub 2022 Sep 5.
3
Feature Pyramid Reconfiguration with Consistent Loss for Object Detection.用于目标检测的具有一致损失的特征金字塔重构
IEEE Trans Image Process. 2019 May 24. doi: 10.1109/TIP.2019.2917781.
4
Foreground Capture Feature Pyramid Network-Oriented Object Detection in Complex Backgrounds.面向复杂背景中对象检测的前景捕捉特征金字塔网络
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):6925-6939. doi: 10.1109/TNNLS.2024.3387282. Epub 2025 Apr 4.
5
SF-YOLOv5: A Lightweight Small Object Detection Algorithm Based on Improved Feature Fusion Mode.SF-YOLOv5:一种基于改进特征融合模式的轻量级小目标检测算法。
Sensors (Basel). 2022 Aug 4;22(15):5817. doi: 10.3390/s22155817.
6
HA-FPN: Hierarchical Attention Feature Pyramid Network for Object Detection.HA-FPN:用于目标检测的层次注意特征金字塔网络。
Sensors (Basel). 2023 May 5;23(9):4508. doi: 10.3390/s23094508.
7
Scale Enhancement Pyramid Network for Small Object Detection from UAV Images.用于无人机图像中小目标检测的尺度增强金字塔网络
Entropy (Basel). 2022 Nov 21;24(11):1699. doi: 10.3390/e24111699.
8
Conformer: Local Features Coupling Global Representations for Recognition and Detection.构象:用于识别和检测的局部特征与全局表示相结合。
IEEE Trans Pattern Anal Mach Intell. 2023 Aug;45(8):9454-9468. doi: 10.1109/TPAMI.2023.3243048. Epub 2023 Jun 30.
9
Robotic Grasp Detection Network Based on Improved Deformable Convolution and Spatial Feature Center Mechanism.基于改进可变形卷积和空间特征中心机制的机器人抓取检测网络
Biomimetics (Basel). 2023 Sep 1;8(5):403. doi: 10.3390/biomimetics8050403.
10
Chip Pad Inspection Method Based on an Improved YOLOv5 Algorithm.基于改进 YOLOv5 算法的芯片焊点检测方法
Sensors (Basel). 2022 Sep 4;22(17):6685. doi: 10.3390/s22176685.

引用本文的文献

1
AITP-YOLO: improved tomato ripeness detection model based on multiple strategies.AITP-YOLO:基于多种策略的改进型番茄成熟度检测模型。
Front Plant Sci. 2025 May 26;16:1596739. doi: 10.3389/fpls.2025.1596739. eCollection 2025.
2
Research on target detection based on improved YOLOv7 in complex traffic scenarios.基于改进YOLOv7的复杂交通场景目标检测研究
PLoS One. 2025 May 19;20(5):e0323410. doi: 10.1371/journal.pone.0323410. eCollection 2025.
3
Small-Target Detection Algorithm Based on STDA-YOLOv8.基于STDA-YOLOv8的小目标检测算法
Sensors (Basel). 2025 Apr 30;25(9):2861. doi: 10.3390/s25092861.
4
EnSLDe: an enhanced short-range and long-range dependent system for brain tumor classification.EnSLDe:一种用于脑肿瘤分类的增强型短程和长程相关系统。
Front Oncol. 2025 Apr 11;15:1512739. doi: 10.3389/fonc.2025.1512739. eCollection 2025.
5
A non-invasive diagnostic approach for neuroblastoma utilizing preoperative enhanced computed tomography and deep learning techniques.一种利用术前增强计算机断层扫描和深度学习技术的神经母细胞瘤无创诊断方法。
Sci Rep. 2025 Apr 26;15(1):14652. doi: 10.1038/s41598-025-99451-5.
6
Object detection model design for tiny road surface damage.微小路面损伤的目标检测模型设计
Sci Rep. 2025 Apr 1;15(1):11032. doi: 10.1038/s41598-025-95502-z.
7
Detection of Flexible Pavement Surface Cracks in Coastal Regions Using Deep Learning and 2D/3D Images.利用深度学习和二维/三维图像检测沿海地区柔性路面表面裂缝
Sensors (Basel). 2025 Feb 13;25(4):1145. doi: 10.3390/s25041145.
8
Pepper-YOLO: an lightweight model for green pepper detection and picking point localization in complex environments.辣椒-YOLO:一种用于复杂环境中青椒检测与采摘点定位的轻量级模型。
Front Plant Sci. 2024 Dec 31;15:1508258. doi: 10.3389/fpls.2024.1508258. eCollection 2024.
9
DINOV2-FCS: a model for fruit leaf disease classification and severity prediction.DINOV2-FCS:一种用于果树叶部病害分类和严重程度预测的模型。
Front Plant Sci. 2024 Dec 6;15:1475282. doi: 10.3389/fpls.2024.1475282. eCollection 2024.
10
BGF-YOLOv10: Small Object Detection Algorithm from Unmanned Aerial Vehicle Perspective Based on Improved YOLOv10.BGF-YOLOv10:基于改进YOLOv10的无人机视角小目标检测算法
Sensors (Basel). 2024 Oct 28;24(21):6911. doi: 10.3390/s24216911.