• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于目标检测的可变形胶囊

Deformable Capsules for Object Detection.

作者信息

LaLonde Rodney, Khosravan Naji, Bagci Ulas

机构信息

Palantir Technologies, Washington, DC.

Zillow, Seattle, WA.

出版信息

Adv Intell Syst. 2024 Sep;6(9). doi: 10.1002/aisy.202400044. Epub 2024 Aug 22.

DOI:10.1002/aisy.202400044
PMID:39669747
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11632968/
Abstract

Capsule networks promise significant benefits over convolutional networks by storing stronger internal representations, and routing information based on the agreement between intermediate representations' projections. Despite this, their success has been limited to small-scale classification datasets due to their computationally expensive nature. Though memory efficient, convolutional capsules impose geometric constraints that fundamentally limit the ability of capsules to model the pose/deformation of objects. Further, they do not address the bigger memory concern of class-capsules scaling up to bigger tasks such as detection or large-scale classification. In this study, we introduce a new family of capsule networks, deformable capsules (), to address a very important problem in computer vision: object detection. We propose two new algorithms associated with our : a novel capsule structure (), and a novel dynamic routing algorithm (), which balance computational efficiency with the need for modeling a large number of objects and classes, which have never been achieved with capsule networks before. We demonstrate that the proposed methods efficiently scale up to create the first-ever capsule network for object detection in the literature. Our proposed architecture is a one-stage detection framework and it obtains results on MS COCO which are on par with state-of-the-art one-stage CNN-based methods, while producing fewer false positive detection, generalizing to unusual poses/viewpoints of objects.

摘要

胶囊网络通过存储更强的内部表示,并基于中间表示投影之间的一致性来路由信息,有望比卷积网络带来显著优势。尽管如此,由于其计算成本高昂,它们的成功仅限于小规模分类数据集。卷积胶囊虽然内存效率高,但施加了几何约束,从根本上限制了胶囊对物体姿态/变形进行建模的能力。此外,它们没有解决类胶囊扩展到诸如检测或大规模分类等更大任务时更大的内存问题。在本研究中,我们引入了一个新的胶囊网络家族,即可变形胶囊(),以解决计算机视觉中一个非常重要的问题:目标检测。我们提出了与我们的相关的两种新算法:一种新颖的胶囊结构()和一种新颖的动态路由算法(),它们在计算效率与对大量物体和类进行建模的需求之间取得平衡,这是胶囊网络以前从未实现过的。我们证明,所提出的方法能够有效地扩展,从而创建了文献中首个用于目标检测的胶囊网络。我们提出的架构是一个单阶段检测框架,它在MS COCO数据集上获得的结果与基于最先进的单阶段卷积神经网络的方法相当,同时产生更少的误报检测,能够推广到物体的异常姿态/视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/b8bf436f37c7/nihms-2015393-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/45e5457dd101/nihms-2015393-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/74797ba02a74/nihms-2015393-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/a745a0caa015/nihms-2015393-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/092e12bf5fd9/nihms-2015393-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/9b54760df1df/nihms-2015393-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/22ae977892cd/nihms-2015393-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/b8bf436f37c7/nihms-2015393-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/45e5457dd101/nihms-2015393-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/74797ba02a74/nihms-2015393-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/a745a0caa015/nihms-2015393-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/092e12bf5fd9/nihms-2015393-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/9b54760df1df/nihms-2015393-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/22ae977892cd/nihms-2015393-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8527/11632968/b8bf436f37c7/nihms-2015393-f0007.jpg

相似文献

1
Deformable Capsules for Object Detection.用于目标检测的可变形胶囊
Adv Intell Syst. 2024 Sep;6(9). doi: 10.1002/aisy.202400044. Epub 2024 Aug 22.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
6
Sexual Harassment and Prevention Training性骚扰与预防培训
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.
9
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
10
Incentives for preventing smoking in children and adolescents.预防儿童和青少年吸烟的激励措施。
Cochrane Database Syst Rev. 2017 Jun 6;6(6):CD008645. doi: 10.1002/14651858.CD008645.pub3.

本文引用的文献

1
Rethinking Portrait Matting with Privacy Preserving.基于隐私保护的人像抠图再思考。
Int J Comput Vis. 2023 May 20:1-26. doi: 10.1007/s11263-023-01797-8.