理解深度神经网络中聚合层的分布

Understanding the Distributions of Aggregation Layers in Deep Neural Networks.

作者信息

Ong Eng-Jon, Husain Sameed, Bober Miroslaw

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5536-5550. doi: 10.1109/TNNLS.2022.3207790. Epub 2024 Apr 4.

DOI:10.1109/TNNLS.2022.3207790

PMID:36197864

Abstract

The process of aggregation is ubiquitous in almost all the deep nets' models. It functions as an important mechanism for consolidating deep features into a more compact representation while increasing the robustness to overfitting and providing spatial invariance in deep nets. In particular, the proximity of global aggregation layers to the output layers of DNNs means that aggregated features directly influence the performance of a deep net. A better understanding of this relationship can be obtained using information theoretic methods. However, this requires knowledge of the distributions of the activations of aggregation layers. To achieve this, we propose a novel mathematical formulation for analytically modeling the probability distributions of output values of layers involved with deep feature aggregation. An important outcome is our ability to analytically predict the Kullback-Leibler (KL)-divergence of output nodes in a DNN. We also experimentally verify our theoretical predictions against empirical observations across a broad range of different classification tasks and datasets.

摘要

聚合过程在几乎所有深度网络模型中无处不在。它作为一种重要机制，将深度特征整合为更紧凑的表示形式，同时增强对过拟合的鲁棒性，并在深度网络中提供空间不变性。特别是，全局聚合层与深度神经网络输出层的接近意味着聚合特征直接影响深度网络的性能。使用信息论方法可以更好地理解这种关系。然而，这需要了解聚合层激活的分布情况。为了实现这一点，我们提出了一种新颖的数学公式，用于对涉及深度特征聚合的层的输出值概率分布进行分析建模。一个重要成果是我们能够分析预测深度神经网络中输出节点的库尔贝克-莱布勒（KL）散度。我们还通过实验针对广泛的不同分类任务和数据集的实证观察验证了我们的理论预测。

相似文献

Understanding the Distributions of Aggregation Layers in Deep Neural Networks.理解深度神经网络中聚合层的分布

IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5536-5550. doi: 10.1109/TNNLS.2022.3207790. Epub 2024 Apr 4.

QSAR with experimental and predictive distributions: an information theoretic approach for assessing model quality.定量构效关系（QSAR）与实验和预测分布：评估模型质量的信息理论方法。

J Comput Aided Mol Des. 2013 Mar;27(3):203-19. doi: 10.1007/s10822-013-9639-5. Epub 2013 Mar 16.

Task-specific feature extraction and classification of fMRI volumes using a deep neural network initialized with a deep belief network: Evaluation using sensorimotor tasks.使用由深度信念网络初始化的深度神经网络对功能磁共振成像（fMRI）体积进行特定任务特征提取和分类：基于感觉运动任务的评估

Neuroimage. 2017 Jan 15;145(Pt B):314-328. doi: 10.1016/j.neuroimage.2016.04.003. Epub 2016 Apr 11.

Improving robustness of a deep learning-based lung-nodule classification model of CT images with respect to image noise.提高基于深度学习的 CT 图像肺结节分类模型对图像噪声鲁棒性。

Phys Med Biol. 2021 Dec 7;66(24). doi: 10.1088/1361-6560/ac3d16.

Deep Convolutional Neural Networks Outperform Feature-Based But Not Categorical Models in Explaining Object Similarity Judgments.在解释物体相似性判断方面，深度卷积神经网络的表现优于基于特征的模型，但不优于分类模型。

Front Psychol. 2017 Oct 9;8:1726. doi: 10.3389/fpsyg.2017.01726. eCollection 2017.

Deep convolutional neural network and IoT technology for healthcare.用于医疗保健的深度卷积神经网络和物联网技术。

Digit Health. 2024 Jan 17;10:20552076231220123. doi: 10.1177/20552076231220123. eCollection 2024 Jan-Dec.

Grounding deep neural network predictions of human categorization behavior in understandable functional features: The case of face identity.将人类分类行为的深度神经网络预测基于可理解的功能特征进行锚定：以面部识别为例。

Patterns (N Y). 2021 Sep 10;2(10):100348. doi: 10.1016/j.patter.2021.100348. eCollection 2021 Oct 8.

THINGSvision: A Python Toolbox for Streamlining the Extraction of Activations From Deep Neural Networks.THINGSvision：用于简化从深度神经网络中提取激活值的Python工具箱。

Front Neuroinform. 2021 Sep 22;15:679838. doi: 10.3389/fninf.2021.679838. eCollection 2021.

An Information Theoretic Interpretation to Deep Neural Networks.对深度神经网络的信息论解释

Entropy (Basel). 2022 Jan 17;24(1):135. doi: 10.3390/e24010135.

Memory-Replay Knowledge Distillation.记忆重放知识蒸馏。

Sensors (Basel). 2021 Apr 15;21(8):2792. doi: 10.3390/s21082792.

引用本文的文献

Evaluation of the invasiveness of pure ground-glass nodules based on dual-head ResNet technique.基于双头 ResNet 技术的纯磨玻璃结节侵袭性评估。

BMC Cancer. 2024 Sep 2;24(1):1080. doi: 10.1186/s12885-024-12823-4.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

理解深度神经网络中聚合层的分布

Understanding the Distributions of Aggregation Layers in Deep Neural Networks.

作者信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献