• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DPNet:用于实时目标检测的带轻量级注意力机制的双路径网络

DPNet: Dual-Path Network for Real-Time Object Detection With Lightweight Attention.

作者信息

Zhou Quan, Shi Huimin, Xiang Weikang, Kang Bin, Latecki Longin Jan

出版信息

IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4504-4518. doi: 10.1109/TNNLS.2024.3376563. Epub 2025 Feb 28.

DOI:10.1109/TNNLS.2024.3376563
PMID:38536700
Abstract

The recent advances in compressing high-accuracy convolutional neural networks (CNNs) have witnessed remarkable progress in real-time object detection. To accelerate detection speed, lightweight detectors always have few convolution layers using a single-path backbone. Single-path architecture, however, involves continuous pooling and downsampling operations, always resulting in coarse and inaccurate feature maps that are disadvantageous to locate objects. On the other hand, due to limited network capacity, recent lightweight networks are often weak in representing large-scale visual data. To address these problems, we present a dual-path network, named DPNet, with a lightweight attention scheme for real-time object detection. The dual-path architecture enables us to extract in parallel high-level semantic features and low-level object details. Although DPNet has a nearly duplicated shape with respect to single-path detectors, the computational costs and model size are not significantly increased. To enhance representation capability, a lightweight self-correlation module (LSCM) is designed to capture global interactions, with only a few computational overheads and network parameters. In the neck, LSCM is extended into a lightweight cross correlation module (LCCM), capturing mutual dependencies among neighboring scale features. We have conducted exhaustive experiments on MS COCO, Pascal VOC 2007, and ImageNet datasets. The experimental results demonstrate that DPNet achieves a state-of-the-art trade off between detection accuracy and implementation efficiency. More specifically, DPNet achieves 31.3% AP on MS COCO test-dev, 82.7% mAP on Pascal VOC 2007 test set, and 41.6% mAP on ImageNet validation set, together with nearly 2.5M model size, 1.04 GFLOPs, and 164 and 196 frames/s (FPS) FPS for input images of three datasets.

摘要

在压缩高精度卷积神经网络(CNN)方面的最新进展在实时目标检测中取得了显著进展。为了加快检测速度,轻量级检测器通常使用单路径主干,卷积层较少。然而,单路径架构涉及连续的池化和下采样操作,总是会产生粗糙且不准确的特征图,不利于目标定位。另一方面,由于网络容量有限,最近的轻量级网络在表示大规模视觉数据方面往往较弱。为了解决这些问题,我们提出了一种双路径网络,名为DPNet,它具有用于实时目标检测的轻量级注意力机制。双路径架构使我们能够并行提取高级语义特征和低级目标细节。虽然DPNet相对于单路径检测器具有几乎相同的形状,但计算成本和模型大小并没有显著增加。为了增强表示能力,设计了一种轻量级自相关模块(LSCM)来捕获全局交互,只需要很少的计算开销和网络参数。在颈部,LSCM扩展为轻量级交叉相关模块(LCCM),捕获相邻尺度特征之间的相互依赖关系。我们在MS COCO、Pascal VOC 2007和ImageNet数据集上进行了详尽的实验。实验结果表明,DPNet在检测精度和实现效率之间实现了最优平衡。具体而言,DPNet在MS COCO测试开发集上达到31.3%的平均精度(AP),在Pascal VOC 2007测试集上达到82.7%的平均精度均值(mAP),在ImageNet验证集上达到41.6%的平均精度均值(mAP),同时模型大小接近250万,浮点运算次数(GFLOPs)为1.04,对于三个数据集的输入图像,帧率分别为164帧/秒和196帧/秒(FPS)。

相似文献

1
DPNet: Dual-Path Network for Real-Time Object Detection With Lightweight Attention.DPNet:用于实时目标检测的带轻量级注意力机制的双路径网络
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4504-4518. doi: 10.1109/TNNLS.2024.3376563. Epub 2025 Feb 28.
2
Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection.用于快速目标检测的双分辨率双路径卷积神经网络
Sensors (Basel). 2019 Jul 14;19(14):3111. doi: 10.3390/s19143111.
3
A new multi-scale backbone network for object detection based on asymmetric convolutions.基于非对称卷积的目标检测新的多尺度骨干网络。
Sci Prog. 2021 Apr-Jun;104(2):368504211011343. doi: 10.1177/00368504211011343.
4
Lightweight Feature Enhancement Network for Single-Shot Object Detection.轻量级特征增强网络用于单目标检测。
Sensors (Basel). 2021 Feb 4;21(4):1066. doi: 10.3390/s21041066.
5
Lightweight multi-scale network for small object detection.用于小目标检测的轻量级多尺度网络。
PeerJ Comput Sci. 2022 Nov 8;8:e1145. doi: 10.7717/peerj-cs.1145. eCollection 2022.
6
Object detectors involving a NAS-gate convolutional module and capsule attention module.基于 NAS 门控卷积模块和胶囊注意力模块的目标探测器。
Sci Rep. 2022 Mar 10;12(1):3916. doi: 10.1038/s41598-022-07898-7.
7
Alpha-SGANet: A multi-attention-scale feature pyramid network combined with lightweight network based on Alpha-IoU loss.Alpha-SGANet:一种基于 Alpha-IoU 损失的多注意力尺度特征金字塔网络与轻量级网络相结合的方法。
PLoS One. 2022 Oct 27;17(10):e0276581. doi: 10.1371/journal.pone.0276581. eCollection 2022.
8
DPSSD: Dual-Path Single-Shot Detector.DPSSD:双路径单发探测器。
Sensors (Basel). 2022 Jun 18;22(12):4616. doi: 10.3390/s22124616.
9
[An efficient and lightweight skin pathology detection method based on multi-scale feature fusion using an improved RT-DETR model].基于改进的RT-DETR模型多尺度特征融合的高效轻量级皮肤病理学检测方法
Nan Fang Yi Ke Da Xue Xue Bao. 2025 Feb 20;45(2):409-421. doi: 10.12122/j.issn.1673-4254.2025.02.22.
10
Faster SCDNet: Real-Time Semantic Segmentation Network with Split Connection and Flexible Dilated Convolution.更快的 SCDNet:具有分割连接和灵活空洞卷积的实时语义分割网络。
Sensors (Basel). 2023 Mar 14;23(6):3112. doi: 10.3390/s23063112.

引用本文的文献

1
Deep neural networks for automated damage classification in image-based visual data of reinforced concrete structures.用于钢筋混凝土结构基于图像的视觉数据中损伤自动分类的深度神经网络。
Heliyon. 2024 Sep 19;10(19):e38104. doi: 10.1016/j.heliyon.2024.e38104. eCollection 2024 Oct 15.
2
A Lightweight and Efficient Multi-Type Defect Detection Method for Transmission Lines Based on DCP-YOLOv8.一种基于DCP-YOLOv8的轻量级高效输电线路多类型缺陷检测方法
Sensors (Basel). 2024 Jul 11;24(14):4491. doi: 10.3390/s24144491.