• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 NAS 门控卷积模块和胶囊注意力模块的目标探测器。

Object detectors involving a NAS-gate convolutional module and capsule attention module.

机构信息

Division of Mechanical and Biomedical Engineering, Graduate Program in System Health Science and Engineering, Ewha Womans University, Seoul, 03760, Republic of Korea.

出版信息

Sci Rep. 2022 Mar 10;12(1):3916. doi: 10.1038/s41598-022-07898-7.

DOI:10.1038/s41598-022-07898-7
PMID:35273256
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8913793/
Abstract

Several state-of-the-art object detectors have demonstrated outstanding performances by optimizing feature representation through modification of the backbone architecture and exploitation of a feature pyramid. To determine the effectiveness of this approach, we explore the modification of object detectors' backbone and feature pyramid by utilizing Neural Architecture Search (NAS) and Capsule Network. We introduce two modules, namely, NAS-gate convolutional module and Capsule Attention module. The NAS-gate convolutional module optimizes standard convolution in a backbone network based on differentiable architecture search cooperation with multiple convolution conditions to overcome object scale variation problems. The Capsule Attention module exploits the strong spatial relationship encoding ability of the capsule network to generate a spatial attention mask, which emphasizes important features and suppresses unnecessary features in the feature pyramid, in order to optimize the feature representation and localization capability of the detectors. Experimental results indicate that the NAS-gate convolutional module can alleviate the object scale variation problem and the Capsule Attention network can help to avoid inaccurate localization. Next, we introduce NASGC-CapANet, which incorporates the two modules, i.e., a NAS-gate convolutional module and capsule attention module. Results of comparisons against state-of-the-art object detectors on the MS COCO val-2017 dataset demonstrate that NASGC-CapANet-based Faster R-CNN significantly outperforms the baseline Faster R-CNN with a ResNet-50 backbone and a ResNet-101 backbone by mAPs of 2.7% and 2.0%, respectively. Furthermore, the NASGC-CapANet-based Cascade R-CNN achieves a box mAP of 43.8% on the MS COCO test-dev dataset.

摘要

几种最先进的目标检测方法通过修改骨干网络结构和利用特征金字塔来优化特征表示,展示了出色的性能。为了确定这种方法的有效性,我们通过使用神经架构搜索 (NAS) 和胶囊网络来探索对目标检测器骨干和特征金字塔的修改。我们引入了两个模块,即 NAS 门控卷积模块和胶囊注意力模块。NAS 门控卷积模块基于可微分架构搜索与多个卷积条件合作,优化骨干网络中的标准卷积,以克服目标尺度变化问题。胶囊注意力模块利用胶囊网络的强大空间关系编码能力生成空间注意力掩模,该掩模在特征金字塔中强调重要特征并抑制不必要的特征,从而优化检测器的特征表示和定位能力。实验结果表明,NAS 门控卷积模块可以减轻目标尺度变化问题,胶囊注意力网络可以帮助避免定位不准确。接下来,我们引入了包含两个模块的 NASGC-CapANet,即 NAS 门控卷积模块和胶囊注意力模块。在 MS COCO val-2017 数据集上与最先进的目标检测方法进行比较的结果表明,基于 NASGC-CapANet 的 Faster R-CNN 显著优于具有 ResNet-50 骨干和 ResNet-101 骨干的基线 Faster R-CNN,mAP 分别提高了 2.7%和 2.0%。此外,基于 NASGC-CapANet 的级联 R-CNN 在 MS COCO test-dev 数据集上实现了 43.8%的框 mAP。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/3725228da808/41598_2022_7898_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/ab110f088b6e/41598_2022_7898_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/6c6b7c8ed390/41598_2022_7898_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/637a2c9b16f4/41598_2022_7898_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/08389e122383/41598_2022_7898_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/3725228da808/41598_2022_7898_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/ab110f088b6e/41598_2022_7898_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/6c6b7c8ed390/41598_2022_7898_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/637a2c9b16f4/41598_2022_7898_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/08389e122383/41598_2022_7898_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c59a/8913793/3725228da808/41598_2022_7898_Fig5_HTML.jpg

相似文献

1
Object detectors involving a NAS-gate convolutional module and capsule attention module.基于 NAS 门控卷积模块和胶囊注意力模块的目标探测器。
Sci Rep. 2022 Mar 10;12(1):3916. doi: 10.1038/s41598-022-07898-7.
2
ssFPN: Scale Sequence () Feature-Based Feature Pyramid Network for Object Detection.ssFPN:基于尺度序列(Scale Sequence)特征的目标检测特征金字塔网络。
Sensors (Basel). 2023 Apr 30;23(9):4432. doi: 10.3390/s23094432.
3
Inferior and Coordinate Distillation for Object Detectors.对象检测器的下推与坐标蒸馏。
Sensors (Basel). 2022 Jul 30;22(15):5719. doi: 10.3390/s22155719.
4
RGB-D salient object detection via convolutional capsule network based on feature extraction and integration.基于特征提取与融合的卷积胶囊网络实现RGB-D显著目标检测
Sci Rep. 2023 Oct 17;13(1):17652. doi: 10.1038/s41598-023-44698-z.
5
Multi-scale semantic enhancement network for object detection.用于目标检测的多尺度语义增强网络。
Sci Rep. 2023 May 3;13(1):7178. doi: 10.1038/s41598-023-34277-7.
6
Mask-Refined R-CNN: A Network for Refining Object Details in Instance Segmentation.Mask-Refined R-CNN:用于实例分割中细化对象细节的网络。
Sensors (Basel). 2020 Feb 13;20(4):1010. doi: 10.3390/s20041010.
7
DPNet: Dual-Path Network for Real-Time Object Detection With Lightweight Attention.DPNet:用于实时目标检测的带轻量级注意力机制的双路径网络
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4504-4518. doi: 10.1109/TNNLS.2024.3376563. Epub 2025 Feb 28.
8
MFA-net: Object detection for complex X-ray cargo and baggage security imagery.MFA-net:用于复杂 X 射线货物和行李安全成像的目标检测。
PLoS One. 2022 Sep 1;17(9):e0272961. doi: 10.1371/journal.pone.0272961. eCollection 2022.
9
Alpha-SGANet: A multi-attention-scale feature pyramid network combined with lightweight network based on Alpha-IoU loss.Alpha-SGANet:一种基于 Alpha-IoU 损失的多注意力尺度特征金字塔网络与轻量级网络相结合的方法。
PLoS One. 2022 Oct 27;17(10):e0276581. doi: 10.1371/journal.pone.0276581. eCollection 2022.
10
Single-Shot Object Detection via Feature Enhancement and Channel Attention.基于特征增强与通道注意力的单阶段目标检测
Sensors (Basel). 2022 Sep 10;22(18):6857. doi: 10.3390/s22186857.

引用本文的文献

1
Neural Architecture Search Survey: A Computer Vision Perspective.神经架构搜索综述:计算机视觉视角。
Sensors (Basel). 2023 Feb 3;23(3):1713. doi: 10.3390/s23031713.

本文引用的文献

1
Deep High-Resolution Representation Learning for Visual Recognition.用于视觉识别的深度高分辨率表征学习
IEEE Trans Pattern Anal Mach Intell. 2021 Oct;43(10):3349-3364. doi: 10.1109/TPAMI.2020.2983686. Epub 2021 Sep 2.
2
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.