• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

高效胶囊网络:具有自注意力路由的胶囊网络。

Efficient-CapsNet: capsule network with self-attention routing.

作者信息

Mazzia Vittorio, Salvetti Francesco, Chiaberge Marcello

机构信息

Department of Electronics and Telecommunications, Politecnico di Torino, 10129, Turin, Italy.

PIC4SeR, Politecnico di Torino Interdepartmental Centre for Service Robotics, Turin, Italy.

出版信息

Sci Rep. 2021 Jul 19;11(1):14634. doi: 10.1038/s41598-021-93977-0.

DOI:10.1038/s41598-021-93977-0
PMID:34282164
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8290018/
Abstract

Deep convolutional neural networks, assisted by architectural design strategies, make extensive use of data augmentation techniques and layers with a high number of feature maps to embed object transformations. That is highly inefficient and for large datasets implies a massive redundancy of features detectors. Even though capsules networks are still in their infancy, they constitute a promising solution to extend current convolutional networks and endow artificial visual perception with a process to encode more efficiently all feature affine transformations. Indeed, a properly working capsule network should theoretically achieve higher results with a considerably lower number of parameters count due to intrinsic capability to generalize to novel viewpoints. Nevertheless, little attention has been given to this relevant aspect. In this paper, we investigate the efficiency of capsule networks and, pushing their capacity to the limits with an extreme architecture with barely 160 K parameters, we prove that the proposed architecture is still able to achieve state-of-the-art results on three different datasets with only 2% of the original CapsNet parameters. Moreover, we replace dynamic routing with a novel non-iterative, highly parallelizable routing algorithm that can easily cope with a reduced number of capsules. Extensive experimentation with other capsule implementations has proved the effectiveness of our methodology and the capability of capsule networks to efficiently embed visual representations more prone to generalization.

摘要

深度卷积神经网络在架构设计策略的辅助下,大量使用数据增强技术以及具有大量特征图的层来嵌入对象变换。这效率极低,对于大型数据集而言意味着特征检测器存在大量冗余。尽管胶囊网络仍处于起步阶段,但它们是扩展当前卷积网络并赋予人工视觉感知一种更高效地编码所有特征仿射变换的过程的一个有前途的解决方案。实际上,一个正常工作的胶囊网络理论上应该能够以相当少的参数数量取得更高的结果,这归因于其泛化到新视角的内在能力。然而,这个相关方面几乎没有受到关注。在本文中,我们研究了胶囊网络的效率,并通过一个仅有约16万个参数的极端架构将其能力推向极限,我们证明所提出的架构仅用原始胶囊网络(CapsNet)2%的参数就能在三个不同数据集上取得当前最优的结果。此外,我们用一种新颖的非迭代、高度可并行化的路由算法取代动态路由,该算法能够轻松应对减少的胶囊数量。对其他胶囊实现方式的大量实验证明了我们方法的有效性以及胶囊网络有效嵌入更易于泛化视觉表示的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/fbca40ae1758/41598_2021_93977_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/32164b372b83/41598_2021_93977_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/1a2530d2c4cc/41598_2021_93977_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/1ea2fbcfbbec/41598_2021_93977_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/a5abbf2f1103/41598_2021_93977_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/80445bf03a89/41598_2021_93977_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/af70af932565/41598_2021_93977_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/df8d4afea202/41598_2021_93977_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/fbca40ae1758/41598_2021_93977_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/32164b372b83/41598_2021_93977_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/1a2530d2c4cc/41598_2021_93977_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/1ea2fbcfbbec/41598_2021_93977_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/a5abbf2f1103/41598_2021_93977_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/80445bf03a89/41598_2021_93977_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/af70af932565/41598_2021_93977_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/df8d4afea202/41598_2021_93977_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80db/8290018/fbca40ae1758/41598_2021_93977_Fig8_HTML.jpg

相似文献

1
Efficient-CapsNet: capsule network with self-attention routing.高效胶囊网络:具有自注意力路由的胶囊网络。
Sci Rep. 2021 Jul 19;11(1):14634. doi: 10.1038/s41598-021-93977-0.
2
Mask Dynamic Routing to Combined Model of Deep Capsule Network and U-Net.面向深度胶囊网络与U-Net组合模型的掩码动态路由
IEEE Trans Neural Netw Learn Syst. 2020 Jul;31(7):2653-2664. doi: 10.1109/TNNLS.2020.2984686. Epub 2020 Apr 17.
3
Capsule networks with non-iterative cluster routing.胶囊网络的非迭代聚类路由。
Neural Netw. 2021 Nov;143:690-697. doi: 10.1016/j.neunet.2021.07.032. Epub 2021 Aug 8.
4
Feature Correlation-Steered Capsule Network for object detection.基于特征关联的胶囊网络目标检测方法
Neural Netw. 2022 Mar;147:25-41. doi: 10.1016/j.neunet.2021.12.003. Epub 2021 Dec 11.
5
CapsNet-LDA: predicting lncRNA-disease associations using attention mechanism and capsule network based on multi-view data.CapsNet-LDA:基于多视图数据,利用注意力机制和胶囊网络预测长链非编码RNA与疾病的关联
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac531.
6
CACNN: Capsule Attention Convolutional Neural Networks for 3D Object Recognition.CACNN:用于3D目标识别的胶囊注意力卷积神经网络
IEEE Trans Neural Netw Learn Syst. 2025 Mar;36(3):4091-4102. doi: 10.1109/TNNLS.2023.3326606. Epub 2025 Feb 28.
7
Graph routing between capsules.胶囊之间的图路由。
Neural Netw. 2021 Nov;143:345-354. doi: 10.1016/j.neunet.2021.06.018. Epub 2021 Jun 23.
8
TTDCapsNet: Tri Texton-Dense Capsule Network for complex and medical image recognition.TTDCapsNet:用于复杂和医学图像识别的三纹理元-密集胶囊网络。
PLoS One. 2024 Mar 15;19(3):e0300133. doi: 10.1371/journal.pone.0300133. eCollection 2024.
9
A pre-trained convolutional neural network with optimized capsule networks for chest X-rays COVID-19 diagnosis.一种用于胸部X光新冠病毒肺炎诊断的、带有优化胶囊网络的预训练卷积神经网络。
Cluster Comput. 2023;26(2):1389-1403. doi: 10.1007/s10586-022-03703-2. Epub 2022 Aug 23.
10
Fluorescence microscopy image classification of 2D HeLa cells based on the CapsNet neural network.基于 CapsNet 神经网络的 2D HeLa 细胞荧光显微镜图像分类。
Med Biol Eng Comput. 2019 Jun;57(6):1187-1198. doi: 10.1007/s11517-018-01946-z. Epub 2019 Jan 28.

引用本文的文献

1
Automated Detection of Epileptic Seizures in EEG Signals via Micro-Capsule Networks.通过微胶囊网络自动检测脑电图信号中的癫痫发作
Brain Sci. 2025 Aug 7;15(8):842. doi: 10.3390/brainsci15080842.
2
Reg2ST: recognizing potential patterns from gene expression for spatial transcriptomics prediction.Reg2ST:从基因表达中识别潜在模式以进行空间转录组学预测。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf425.
3
Parametric matrix models.参数矩阵模型

本文引用的文献

1
DA-CapsNet: dual attention mechanism capsule network.DA-CapsNet:双注意力机制胶囊网络。
Sci Rep. 2020 Jul 9;10(1):11383. doi: 10.1038/s41598-020-68453-w.
2
Automated Classification of Apoptosis in Phase Contrast Microscopy Using Capsule Network.基于胶囊网络的相差显微镜下细胞凋亡的自动分类。
IEEE Trans Med Imaging. 2020 Jan;39(1):1-10. doi: 10.1109/TMI.2019.2918181. Epub 2019 May 22.
Nat Commun. 2025 Jul 1;16(1):5929. doi: 10.1038/s41467-025-61362-4.
4
Combining spatial transcriptomics with tissue morphology.将空间转录组学与组织形态学相结合。
Nat Commun. 2025 May 13;16(1):4452. doi: 10.1038/s41467-025-58989-8.
5
Capsule neural network and its applications in drug discovery.胶囊神经网络及其在药物发现中的应用。
iScience. 2025 Mar 14;28(4):112217. doi: 10.1016/j.isci.2025.112217. eCollection 2025 Apr 18.
6
Interpretable capsule networks via self attention routing on spatially invariant feature surfaces.通过在空间不变特征表面上的自注意力路由实现可解释的胶囊网络。
Sci Rep. 2025 Apr 15;15(1):13026. doi: 10.1038/s41598-025-96903-w.
7
Integrating spatial transcriptomics and bulk RNA-seq: predicting gene expression with enhanced resolution through graph attention networks.整合空间转录组学和批量 RNA-seq:通过图注意网络提高分辨率预测基因表达。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae316.
8
Attention-gated 3D CapsNet for robust hippocampal segmentation.用于稳健海马体分割的注意力门控3D胶囊网络
J Med Imaging (Bellingham). 2024 Jan;11(1):014003. doi: 10.1117/1.JMI.11.1.014003. Epub 2024 Jan 2.
9
THItoGene: a deep learning method for predicting spatial transcriptomics from histological images.THItoGene:一种从组织学图像预测空间转录组学的深度学习方法。
Brief Bioinform. 2023 Nov 22;25(1). doi: 10.1093/bib/bbad464.
10
DCAMCP: A deep learning model based on capsule network and attention mechanism for molecular carcinogenicity prediction.DCAMCP:一种基于胶囊网络和注意力机制的深度学习模型,用于分子致癌性预测。
J Cell Mol Med. 2023 Oct;27(20):3117-3126. doi: 10.1111/jcmm.17889. Epub 2023 Jul 31.