文字之窗：利用词嵌入探索卷积神经网络的习得表示。

Words as a window: Using word embeddings to explore the learned representations of Convolutional Neural Networks.

机构信息

University of Victoria, Department of Computer Science, 3800 Finnerty Road, Victoria, British Columbia, Canada.

University of Alberta, Department of Computing Science & Department of Psychology, 116 St. and 85 Ave., Edmonton, Alberta, Canada.

出版信息

Neural Netw. 2021 May;137:63-74. doi: 10.1016/j.neunet.2020.12.009. Epub 2021 Jan 22.

DOI:10.1016/j.neunet.2020.12.009

PMID:33556802

Abstract

As deep neural net architectures minimize loss, they accumulate information in a hierarchy of learned representations that ultimately serve the network's final goal. Different architectures tackle this problem in slightly different ways, but all create intermediate representational spaces built to inform their final prediction. Here we show that very different neural networks trained on two very different tasks build knowledge representations that display similar underlying patterns. Namely, we show that the representational spaces of several distributional semantic models bear a remarkable resemblance to several Convolutional Neural Network (CNN) architectures (trained for image classification). We use this information to explore the network behavior of CNNs (1) in pretrained models, (2) during training, and (3) during adversarial attacks. We use these findings to motivate several applications aimed at improving future research on CNNs. Our work illustrates the power of using one model to explore another, gives new insights into the function of CNN models, and provides a framework for others to perform similar analyses when developing new architectures. We show that one neural network model can provide a window into understanding another.

摘要

随着深度神经网络架构最小化损失，它们会在学习的表示层次结构中积累信息，这些信息最终将服务于网络的最终目标。不同的架构以略有不同的方式解决这个问题，但都创建了中间表示空间，旨在为最终预测提供信息。在这里，我们表明，在两个非常不同的任务上训练的非常不同的神经网络会构建显示出相似潜在模式的知识表示。也就是说，我们表明，几个分布语义模型的表示空间与几个卷积神经网络（CNN）架构（用于图像分类训练）非常相似。我们利用这些信息来探索 CNN 的网络行为（1）在预训练模型中，（2）在训练期间，以及（3）在对抗攻击期间。我们利用这些发现来激发几项旨在改进未来 CNN 研究的应用。我们的工作说明了使用一个模型来探索另一个模型的强大功能，为 CNN 模型的功能提供了新的见解，并为其他人在开发新架构时进行类似分析提供了框架。我们表明，一个神经网络模型可以提供理解另一个模型的窗口。

相似文献

Words as a window: Using word embeddings to explore the learned representations of Convolutional Neural Networks.

Neural Netw. 2021 May;137:63-74. doi: 10.1016/j.neunet.2020.12.009. Epub 2021 Jan 22.

Exploring Deep Learning and Transfer Learning for Colonic Polyp Classification.

Comput Math Methods Med. 2016;2016:6584725. doi: 10.1155/2016/6584725. Epub 2016 Oct 26.

Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.

J Am Med Inform Assoc. 2020 Jan 1;27(1):89-98. doi: 10.1093/jamia/ocz153.

CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks.

Neural Netw. 2021 Jul;139:305-325. doi: 10.1016/j.neunet.2021.03.017. Epub 2021 Mar 19.

Study of the Application of Deep Convolutional Neural Networks (CNNs) in Processing Sensor Data and Biomedical Images.

Sensors (Basel). 2019 Aug 17;19(16):3584. doi: 10.3390/s19163584.

Biomedical Text Classification Using Augmented Word Representation Based on Distributional and Relational Contexts.

Comput Intell Neurosci. 2023 Feb 15;2023:2989791. doi: 10.1155/2023/2989791. eCollection 2023.

Comparison of Word and Character Level Information for Medical Term Identification Using Convolutional Neural Networks and Transformers.

Stud Health Technol Inform. 2021 Dec 15;284:249-253. doi: 10.3233/SHTI210717.

Biomedical literature classification with a CNNs-based hybrid learning network.

PLoS One. 2018 Jul 26;13(7):e0197933. doi: 10.1371/journal.pone.0197933. eCollection 2018.

Understanding the role of individual units in a deep neural network.

Proc Natl Acad Sci U S A. 2020 Dec 1;117(48):30071-30078. doi: 10.1073/pnas.1907375117. Epub 2020 Sep 1.

A comparison of word embeddings for the biomedical natural language processing.

J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.

引用本文的文献

Computational reconstruction of mental representations using human behavior.

Nat Commun. 2024 May 17;15(1):4183. doi: 10.1038/s41467-024-48114-6.

Impact of word embedding models on text analytics in deep learning environment: a review.

Artif Intell Rev. 2023 Feb 22:1-81. doi: 10.1007/s10462-023-10419-1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

文字之窗：利用词嵌入探索卷积神经网络的习得表示。

Words as a window: Using word embeddings to explore the learned representations of Convolutional Neural Networks.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献