用于快速目标检测的双分辨率双路径卷积神经网络

Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection.

作者信息

Pan Jing, Sun Hanqing, Song Zhanjie, Han Jungong

机构信息

School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China.

School of Mathematics, Tianjin University, Tianjin 300072, China.

出版信息

Sensors (Basel). 2019 Jul 14;19(14):3111. doi: 10.3390/s19143111.

DOI:10.3390/s19143111

PMID:31337121

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6679249/

Abstract

Downsampling input images is a simple trick to speed up visual object-detection algorithms, especially on robotic vision and applied mobile vision systems. However, this trick comes with a significant decline in accuracy. In this paper, dual-resolution dual-path Convolutional Neural Networks (CNNs), named DualNets, are proposed to bump up the accuracy of those detection applications. In contrast to previous methods that simply downsample the input images, DualNets explicitly take dual inputs in different resolutions and extract complementary visual features from these using dual CNN paths. The two paths in a DualNet are a backbone path and an auxiliary path that accepts larger inputs and then rapidly downsamples them to relatively small feature maps. With the help of the carefully designed auxiliary CNN paths in DualNets, auxiliary features are extracted from the larger input with controllable computation. Auxiliary features are then fused with the backbone features using a proposed progressive residual fusion strategy to enrich feature representation.This architecture, as the feature extractor, is further integrated with the Single Shot Detector (SSD) to accomplish latency-sensitive visual object-detection tasks. We evaluate the resulting detection pipeline on Pascal VOC and MS COCO benchmarks. Results show that the proposed DualNets can raise the accuracy of those CNN detection applications that are sensitive to computation payloads.

摘要

对输入图像进行下采样是一种加速视觉目标检测算法的简单技巧，尤其适用于机器人视觉和应用移动视觉系统。然而，这种技巧会导致准确率显著下降。在本文中，我们提出了双分辨率双路径卷积神经网络（CNN），即DualNets，以提高这些检测应用的准确率。与之前简单对输入图像进行下采样的方法不同，DualNets明确采用不同分辨率的双输入，并使用双CNN路径从这些输入中提取互补的视觉特征。DualNet中的两条路径分别是主干路径和辅助路径，辅助路径接受更大的输入，然后迅速将其下采样为相对较小的特征图。借助DualNets中精心设计的辅助CNN路径，可以从更大的输入中提取辅助特征，且计算量可控。然后，使用提出的渐进残差融合策略将辅助特征与主干特征融合，以丰富特征表示。作为特征提取器，这种架构进一步与单阶段检测器（SSD）集成，以完成对延迟敏感的视觉目标检测任务。我们在Pascal VOC和MS COCO基准上评估了由此产生的检测管道。结果表明，所提出的DualNets可以提高那些对计算负载敏感的CNN检测应用的准确率。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/17ea/6679249/a9fc34a31f80/sensors-19-03111-g001.jpg

相似文献

Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection.用于快速目标检测的双分辨率双路径卷积神经网络

Sensors (Basel). 2019 Jul 14;19(14):3111. doi: 10.3390/s19143111.

DPSSD: Dual-Path Single-Shot Detector.DPSSD：双路径单发探测器。

Sensors (Basel). 2022 Jun 18;22(12):4616. doi: 10.3390/s22124616.

DPNet: Dual-Path Network for Real-Time Object Detection With Lightweight Attention.DPNet：用于实时目标检测的带轻量级注意力机制的双路径网络

IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4504-4518. doi: 10.1109/TNNLS.2024.3376563. Epub 2025 Feb 28.

Object Detection Networks on Convolutional Feature Maps.卷积特征图上的目标检测网络。

IEEE Trans Pattern Anal Mach Intell. 2017 Jul;39(7):1476-1481. doi: 10.1109/TPAMI.2016.2601099. Epub 2016 Aug 17.

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection.学习旋转不变和 Fisher 判别卷积神经网络进行目标检测。

IEEE Trans Image Process. 2019 Jan;28(1):265-278. doi: 10.1109/TIP.2018.2867198.

Pay Attention to Them: Deep Reinforcement Learning-Based Cascade Object Detection.关注它们：基于深度强化学习的级联目标检测。

IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2544-2556. doi: 10.1109/TNNLS.2019.2933451. Epub 2019 Sep 2.

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN：基于区域建议网络的实时目标检测。

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.

S-CNN: Subcategory-Aware Convolutional Networks for Object Detection.S-CNN：用于目标检测的子类别感知卷积网络

IEEE Trans Pattern Anal Mach Intell. 2018 Oct;40(10):2522-2528. doi: 10.1109/TPAMI.2017.2756936. Epub 2017 Sep 26.

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition.空间金字塔池化在深度卷积网络中的视觉识别。

IEEE Trans Pattern Anal Mach Intell. 2015 Sep;37(9):1904-16. doi: 10.1109/TPAMI.2015.2389824.

Agricultural Greenhouses Detection in High-Resolution Satellite Images Based on Convolutional Neural Networks: Comparison of Faster R-CNN, YOLO v3 and SSD.基于卷积神经网络的高分辨率卫星图像中的农业温室检测：Faster R-CNN、YOLO v3和SSD的比较

Sensors (Basel). 2020 Aug 31;20(17):4938. doi: 10.3390/s20174938.

引用本文的文献

Enhancing physician support in pancreatic cancer diagnosis: New M-F-RCNN artificial intelligence model using endoscopic ultrasound.增强胰腺癌诊断中的医生支持：使用内镜超声的新型M-F-RCNN人工智能模型

Endosc Int Open. 2024 Nov 7;12(11):E1277-E1284. doi: 10.1055/a-2422-9214. eCollection 2024 Nov.

Deep fusion of gray level co-occurrence matrices for lung nodule classification.基于灰度共生矩阵的深度融合方法用于肺结节分类。

PLoS One. 2022 Sep 29;17(9):e0274516. doi: 10.1371/journal.pone.0274516. eCollection 2022.

Central Attention and a Dual Path Convolutional Neural Network in Real-World Tree Species Recognition.中央注意力与双通道卷积神经网络在真实树种识别中的应用。

Int J Environ Res Public Health. 2021 Jan 22;18(3):961. doi: 10.3390/ijerph18030961.

Mixed YOLOv3-LITE: A Lightweight Real-Time Object Detection Method.混合YOLOv3-LITE：一种轻量级实时目标检测方法。

Sensors (Basel). 2020 Mar 27;20(7):1861. doi: 10.3390/s20071861.

本文引用的文献

DECODE: Deep Confidence Network for Robust Image Classification.DECODE：用于稳健图像分类的深度置信网络。

IEEE Trans Image Process. 2019 Aug;28(8):3752-3765. doi: 10.1109/TIP.2019.2902115. Epub 2019 Feb 27.

Unsupervised Deep Video Hashing via Balanced Code for Large-Scale Video Retrieval.通过平衡码实现的无监督深度视频哈希用于大规模视频检索

IEEE Trans Image Process. 2018 Nov 19. doi: 10.1109/TIP.2018.2882155.

Learning Multilayer Channel Features for Pedestrian Detection.学习用于行人检测的多层通道特征。

IEEE Trans Image Process. 2017 Jul;26(7):3210-3220. doi: 10.1109/TIP.2017.2694224. Epub 2017 Apr 26.

Learning Sampling Distributions for Efficient Object Detection.学习采样分布以实现高效的目标检测。

IEEE Trans Cybern. 2017 Jan;47(1):117-129. doi: 10.1109/TCYB.2015.2508603. Epub 2016 Jan 6.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于快速目标检测的双分辨率双路径卷积神经网络

Dual-Resolution Dual-Path Convolutional Neural Networks for Fast Object Detection.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献