• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迈向深度学习成功的通用机制。

Towards a universal mechanism for successful deep learning.

作者信息

Meir Yuval, Tzach Yarden, Hodassman Shiri, Tevet Ofek, Kanter Ido

机构信息

Department of Physics, Bar-Ilan University, 52900, Ramat-Gan, Israel.

Gonda Interdisciplinary Brain Research Center, Bar-Ilan University, 52900, Ramat-Gan, Israel.

出版信息

Sci Rep. 2024 Mar 11;14(1):5881. doi: 10.1038/s41598-024-56609-x.

DOI:10.1038/s41598-024-56609-x
PMID:38467786
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10928127/
Abstract

Recently, the underlying mechanism for successful deep learning (DL) was presented based on a quantitative method that measures the quality of a single filter in each layer of a DL model, particularly VGG-16 trained on CIFAR-10. This method exemplifies that each filter identifies small clusters of possible output labels, with additional noise selected as labels outside the clusters. This feature is progressively sharpened with each layer, resulting in an enhanced signal-to-noise ratio (SNR), which leads to an increase in the accuracy of the DL network. In this study, this mechanism is verified for VGG-16 and EfficientNet-B0 trained on the CIFAR-100 and ImageNet datasets, and the main results are as follows. First, the accuracy and SNR progressively increase with the layers. Second, for a given deep architecture, the maximal error rate increases approximately linearly with the number of output labels. Third, similar trends were obtained for dataset labels in the range [3, 1000], thus supporting the universality of this mechanism. Understanding the performance of a single filter and its dominating features paves the way to highly dilute the deep architecture without affecting its overall accuracy, and this can be achieved by applying the filter's cluster connections (AFCC).

摘要

最近,基于一种定量方法提出了深度学习(DL)成功的潜在机制,该方法用于衡量DL模型各层中单个滤波器的质量,特别是在CIFAR-10数据集上训练的VGG-16模型。该方法表明,每个滤波器识别可能输出标签的小集群,额外的噪声被选为集群外的标签。随着层数的增加,这一特征逐渐变得更加明显,从而提高了信噪比(SNR),进而提高了DL网络的准确性。在本研究中,对在CIFAR-100和ImageNet数据集上训练的VGG-16和EfficientNet-B0验证了这一机制,主要结果如下。第一,准确率和信噪比随着层数的增加而逐渐提高。第二,对于给定的深度架构,最大错误率随输出标签数量近似线性增加。第三,在[3, 1000]范围内的数据集标签也获得了类似的趋势,从而支持了该机制的普遍性。了解单个滤波器的性能及其主导特征为在不影响整体准确性的情况下大幅简化深度架构铺平了道路,这可以通过应用滤波器的聚类连接(AFCC)来实现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/0c9a5499f0fb/41598_2024_56609_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/a2b12c08c00d/41598_2024_56609_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/fd916c64dd5a/41598_2024_56609_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/2873897d5029/41598_2024_56609_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/c7d168a32aeb/41598_2024_56609_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/abbc2d362696/41598_2024_56609_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/6c571318b41b/41598_2024_56609_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/0c9a5499f0fb/41598_2024_56609_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/a2b12c08c00d/41598_2024_56609_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/fd916c64dd5a/41598_2024_56609_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/2873897d5029/41598_2024_56609_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/c7d168a32aeb/41598_2024_56609_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/abbc2d362696/41598_2024_56609_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/6c571318b41b/41598_2024_56609_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b39c/10928127/0c9a5499f0fb/41598_2024_56609_Fig7_HTML.jpg

相似文献

1
Towards a universal mechanism for successful deep learning.迈向深度学习成功的通用机制。
Sci Rep. 2024 Mar 11;14(1):5881. doi: 10.1038/s41598-024-56609-x.
2
HRel: Filter pruning based on High Relevance between activation maps and class labels.HRel:基于激活图与类别标签之间的高相关性的滤波器修剪。
Neural Netw. 2022 Mar;147:186-197. doi: 10.1016/j.neunet.2021.12.017. Epub 2021 Dec 30.
3
Deep Learning Model Compression With Rank Reduction in Tensor Decomposition.张量分解中基于秩降低的深度学习模型压缩
IEEE Trans Neural Netw Learn Syst. 2025 Jan;36(1):1315-1328. doi: 10.1109/TNNLS.2023.3330542. Epub 2025 Jan 7.
4
An investigation of quantitative accuracy for deep learning based denoising in oncological PET.基于深度学习的肿瘤 PET 去噪定量准确性研究。
Phys Med Biol. 2019 Aug 21;64(16):165019. doi: 10.1088/1361-6560/ab3242.
5
The Role of Knowledge Creation-Oriented Convolutional Neural Network in Learning Interaction.面向知识创造的卷积神经网络在学习交互中的作用。
Comput Intell Neurosci. 2022 Mar 16;2022:6493311. doi: 10.1155/2022/6493311. eCollection 2022.
6
Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。
Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.
7
Detecting cerebral microbleeds via deep learning with features enhancement by reusing ground truth.通过深度学习并借助复用真值来增强特征以检测脑微出血。
Comput Methods Programs Biomed. 2021 Jun;204:106051. doi: 10.1016/j.cmpb.2021.106051. Epub 2021 Mar 12.
8
PCA driven mixed filter pruning for efficient convNets.基于 PCA 的混合滤波器剪枝算法在高效卷积神经网络中的应用。
PLoS One. 2022 Jan 24;17(1):e0262386. doi: 10.1371/journal.pone.0262386. eCollection 2022.
9
Efficient shallow learning as an alternative to deep learning.高效浅层学习作为深度学习的替代方法。
Sci Rep. 2023 Apr 20;13(1):5423. doi: 10.1038/s41598-023-32559-8.
10
Noise reduction with cross-tracer and cross-protocol deep transfer learning for low-dose PET.基于跨示踪剂和跨协议深度迁移学习的低剂量 PET 降噪。
Phys Med Biol. 2020 Sep 14;65(18):185006. doi: 10.1088/1361-6560/abae08.

引用本文的文献

1
A Transfer Learning-Based VGG-16 Model for COD Detection in UV-Vis Spectroscopy.一种基于迁移学习的用于紫外-可见光谱中化学需氧量检测的VGG-16模型。
J Imaging. 2025 May 17;11(5):159. doi: 10.3390/jimaging11050159.

本文引用的文献

1
Enhancing the accuracies by performing pooling decisions adjacent to the output layer.通过在输出层附近执行池化决策来提高准确率。
Sci Rep. 2023 Aug 31;13(1):13385. doi: 10.1038/s41598-023-40566-y.
2
A deep learning framework for neuroscience.深度学习在神经科学中的应用框架。
Nat Neurosci. 2019 Nov;22(11):1761-1770. doi: 10.1038/s41593-019-0520-2. Epub 2019 Oct 28.
3
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.
4
Deep learning in neural networks: an overview.神经网络中的深度学习:综述。
Neural Netw. 2015 Jan;61:85-117. doi: 10.1016/j.neunet.2014.09.003. Epub 2014 Oct 13.