理解深度神经网络中单个单元的作用。

Understanding the role of individual units in a deep neural network.

机构信息

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139;

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139.

出版信息

Proc Natl Acad Sci U S A. 2020 Dec 1;117(48):30071-30078. doi: 10.1073/pnas.1907375117. Epub 2020 Sep 1.

DOI:10.1073/pnas.1907375117

PMID:32873639

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7720226/

Abstract

Deep neural networks excel at finding hierarchical representations that solve complex tasks over large datasets. How can we humans understand these learned representations? In this work, we present network dissection, an analytic framework to systematically identify the semantics of individual hidden units within image classification and image generation networks. First, we analyze a convolutional neural network (CNN) trained on scene classification and discover units that match a diverse set of object concepts. We find evidence that the network has learned many object classes that play crucial roles in classifying scene classes. Second, we use a similar analytic method to analyze a generative adversarial network (GAN) model trained to generate scenes. By analyzing changes made when small sets of units are activated or deactivated, we find that objects can be added and removed from the output scenes while adapting to the context. Finally, we apply our analytic framework to understanding adversarial attacks and to semantic image editing.

摘要

深度神经网络擅长发现层次表示，从而在大型数据集上解决复杂任务。我们人类如何理解这些学习到的表示呢？在这项工作中，我们提出了网络剖析，这是一种分析框架，可以系统地识别图像分类和图像生成网络中各个隐藏单元的语义。首先，我们分析了在场景分类上训练的卷积神经网络 (CNN)，并发现了与多种对象概念匹配的单元。我们有证据表明，该网络已经学习了许多对象类，这些对象类在分类场景类中起着至关重要的作用。其次，我们使用类似的分析方法来分析一个被训练来生成场景的生成对抗网络 (GAN) 模型。通过分析当一小部分单元被激活或失活时所做的改变，我们发现可以在适应上下文的同时向输出场景中添加和删除对象。最后，我们将我们的分析框架应用于理解对抗攻击和语义图像编辑。

相似文献

Understanding the role of individual units in a deep neural network.

Proc Natl Acad Sci U S A. 2020 Dec 1;117(48):30071-30078. doi: 10.1073/pnas.1907375117. Epub 2020 Sep 1.

Interpreting Deep Visual Representations via Network Dissection.

IEEE Trans Pattern Anal Mach Intell. 2019 Sep;41(9):2131-2145. doi: 10.1109/TPAMI.2018.2858759. Epub 2018 Jul 23.

Words as a window: Using word embeddings to explore the learned representations of Convolutional Neural Networks.

Neural Netw. 2021 May;137:63-74. doi: 10.1016/j.neunet.2020.12.009. Epub 2021 Jan 22.

Enhancing classification of cells procured from bone marrow aspirate smears using generative adversarial networks and sequential convolutional neural network.

Comput Methods Programs Biomed. 2022 Sep;224:107019. doi: 10.1016/j.cmpb.2022.107019. Epub 2022 Jul 10.

Perceptual Adversarial Networks for Image-to-Image Transformation.

IEEE Trans Image Process. 2018 Aug;27(8):4066-4079. doi: 10.1109/TIP.2018.2836316. Epub 2018 May 14.

Generative adversarial network based synthetic data training model for lightweight convolutional neural networks.

Multimed Tools Appl. 2023 May 20:1-23. doi: 10.1007/s11042-023-15747-6.

Understanding the role of pathways in a deep neural network.

Neural Netw. 2024 Apr;172:106095. doi: 10.1016/j.neunet.2024.106095. Epub 2024 Jan 4.

Deep Convolutional Generative Adversarial Network (dcGAN) Models for Screening and Design of Small Molecules Targeting Cannabinoid Receptors.

Mol Pharm. 2019 Nov 4;16(11):4451-4460. doi: 10.1021/acs.molpharmaceut.9b00500. Epub 2019 Oct 24.

Image generation by GAN and style transfer for agar plate image segmentation.

Comput Methods Programs Biomed. 2020 Feb;184:105268. doi: 10.1016/j.cmpb.2019.105268. Epub 2019 Dec 17.

Improved automatic detection of herpesvirus secondary envelopment stages in electron microscopy by augmenting training data with synthetic labelled images generated by a generative adversarial network.

Cell Microbiol. 2021 Feb;23(2):e13280. doi: 10.1111/cmi.13280. Epub 2020 Nov 16.

引用本文的文献

Ensuring medical AI safety: interpretability-driven detection and mitigation of spurious model behavior and associated data.

Mach Learn. 2025;114(9):206. doi: 10.1007/s10994-025-06834-w. Epub 2025 Aug 12.

F-TransR: A sports event revenue prediction model integrating multi-modal and time-series data.

PLoS One. 2025 Jul 16;20(7):e0327459. doi: 10.1371/journal.pone.0327459. eCollection 2025.

Optimizing lightweight neural networks for efficient mobile edge computing.

Sci Rep. 2025 Jul 1;15(1):22056. doi: 10.1038/s41598-025-04652-7.

Dimensions underlying the representational alignment of deep neural networks with humans.

Nat Mach Intell. 2025;7(6):848-859. doi: 10.1038/s42256-025-01041-7. Epub 2025 Jun 23.

A deep multiple self-supervised clustering model based on autoencoder networks.

Sci Rep. 2025 May 26;15(1):18372. doi: 10.1038/s41598-025-00349-z.

Crafting Interpretable Embeddings for Language Neuroscience by Asking LLMs Questions.

Adv Neural Inf Process Syst. 2024;37:124137-124162.

Human-like face pareidolia emerges in deep neural networks optimized for face and object recognition.

PLoS Comput Biol. 2025 Jan 27;21(1):e1012751. doi: 10.1371/journal.pcbi.1012751. eCollection 2025 Jan.

Anchor objects drive realism while diagnostic objects drive categorization in GAN generated scenes.

Commun Psychol. 2024 Jul 26;2(1):68. doi: 10.1038/s44271-024-00119-z.

Toward an AI Era: Advances in Electronic Skins.

Chem Rev. 2024 Sep 11;124(17):9899-9948. doi: 10.1021/acs.chemrev.4c00049. Epub 2024 Aug 28.

Network properties determine neural network performance.

Nat Commun. 2024 Jul 8;15(1):5718. doi: 10.1038/s41467-024-48069-8.

本文引用的文献

Interpreting Deep Visual Representations via Network Dissection.

IEEE Trans Pattern Anal Mach Intell. 2019 Sep;41(9):2131-2145. doi: 10.1109/TPAMI.2018.2858759. Epub 2018 Jul 23.

On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation.

PLoS One. 2015 Jul 10;10(7):e0130140. doi: 10.1371/journal.pone.0130140. eCollection 2015.

Representation learning: a review and new perspectives.

IEEE Trans Pattern Anal Mach Intell. 2013 Aug;35(8):1798-828. doi: 10.1109/TPAMI.2013.50.

Learning color names for real-world applications.

IEEE Trans Image Process. 2009 Jul;18(7):1512-23. doi: 10.1109/TIP.2009.2019809. Epub 2009 May 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

理解深度神经网络中单个单元的作用。

Understanding the role of individual units in a deep neural network.

机构信息

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139;

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139.

出版信息

Proc Natl Acad Sci U S A. 2020 Dec 1;117(48):30071-30078. doi: 10.1073/pnas.1907375117. Epub 2020 Sep 1.

DOI:10.1073/pnas.1907375117

PMID:32873639

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7720226/

Abstract

摘要

理解深度神经网络中单个单元的作用。

Understanding the role of individual units in a deep neural network.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

理解深度神经网络中单个单元的作用。

Understanding the role of individual units in a deep neural network.

机构信息

出版信息