• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

文字之窗:利用词嵌入探索卷积神经网络的习得表示。

Words as a window: Using word embeddings to explore the learned representations of Convolutional Neural Networks.

机构信息

University of Victoria, Department of Computer Science, 3800 Finnerty Road, Victoria, British Columbia, Canada.

University of Alberta, Department of Computing Science & Department of Psychology, 116 St. and 85 Ave., Edmonton, Alberta, Canada.

出版信息

Neural Netw. 2021 May;137:63-74. doi: 10.1016/j.neunet.2020.12.009. Epub 2021 Jan 22.

DOI:10.1016/j.neunet.2020.12.009
PMID:33556802
Abstract

As deep neural net architectures minimize loss, they accumulate information in a hierarchy of learned representations that ultimately serve the network's final goal. Different architectures tackle this problem in slightly different ways, but all create intermediate representational spaces built to inform their final prediction. Here we show that very different neural networks trained on two very different tasks build knowledge representations that display similar underlying patterns. Namely, we show that the representational spaces of several distributional semantic models bear a remarkable resemblance to several Convolutional Neural Network (CNN) architectures (trained for image classification). We use this information to explore the network behavior of CNNs (1) in pretrained models, (2) during training, and (3) during adversarial attacks. We use these findings to motivate several applications aimed at improving future research on CNNs. Our work illustrates the power of using one model to explore another, gives new insights into the function of CNN models, and provides a framework for others to perform similar analyses when developing new architectures. We show that one neural network model can provide a window into understanding another.

摘要

随着深度神经网络架构最小化损失,它们会在学习的表示层次结构中积累信息,这些信息最终将服务于网络的最终目标。不同的架构以略有不同的方式解决这个问题,但都创建了中间表示空间,旨在为最终预测提供信息。在这里,我们表明,在两个非常不同的任务上训练的非常不同的神经网络会构建显示出相似潜在模式的知识表示。也就是说,我们表明,几个分布语义模型的表示空间与几个卷积神经网络(CNN)架构(用于图像分类训练)非常相似。我们利用这些信息来探索 CNN 的网络行为(1)在预训练模型中,(2)在训练期间,以及(3)在对抗攻击期间。我们利用这些发现来激发几项旨在改进未来 CNN 研究的应用。我们的工作说明了使用一个模型来探索另一个模型的强大功能,为 CNN 模型的功能提供了新的见解,并为其他人在开发新架构时进行类似分析提供了框架。我们表明,一个神经网络模型可以提供理解另一个模型的窗口。

相似文献

1
Words as a window: Using word embeddings to explore the learned representations of Convolutional Neural Networks.文字之窗:利用词嵌入探索卷积神经网络的习得表示。
Neural Netw. 2021 May;137:63-74. doi: 10.1016/j.neunet.2020.12.009. Epub 2021 Jan 22.
2
Exploring Deep Learning and Transfer Learning for Colonic Polyp Classification.探索深度学习和迁移学习用于结肠息肉分类
Comput Math Methods Med. 2016;2016:6584725. doi: 10.1155/2016/6584725. Epub 2016 Oct 26.
3
Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.使用多任务卷积神经网络从自由文本病理报告中自动提取癌症登记报告信息。
J Am Med Inform Assoc. 2020 Jan 1;27(1):89-98. doi: 10.1093/jamia/ocz153.
4
CiwGAN and fiwGAN: Encoding information in acoustic data to model lexical learning with Generative Adversarial Networks.CiwGAN 和 fiwGAN:利用生成对抗网络将声学数据中的信息编码,以建模词汇学习。
Neural Netw. 2021 Jul;139:305-325. doi: 10.1016/j.neunet.2021.03.017. Epub 2021 Mar 19.
5
Study of the Application of Deep Convolutional Neural Networks (CNNs) in Processing Sensor Data and Biomedical Images.深度学习卷积神经网络(CNNs)在传感器数据和生物医学图像处理中的应用研究。
Sensors (Basel). 2019 Aug 17;19(16):3584. doi: 10.3390/s19163584.
6
Biomedical Text Classification Using Augmented Word Representation Based on Distributional and Relational Contexts.基于分布和关系上下文的增强词表示法进行生物医学文本分类
Comput Intell Neurosci. 2023 Feb 15;2023:2989791. doi: 10.1155/2023/2989791. eCollection 2023.
7
Comparison of Word and Character Level Information for Medical Term Identification Using Convolutional Neural Networks and Transformers.使用卷积神经网络和Transformer进行医学术语识别时词级和字符级信息的比较
Stud Health Technol Inform. 2021 Dec 15;284:249-253. doi: 10.3233/SHTI210717.
8
Biomedical literature classification with a CNNs-based hybrid learning network.基于 CNNs 的混合学习网络的生物医学文献分类。
PLoS One. 2018 Jul 26;13(7):e0197933. doi: 10.1371/journal.pone.0197933. eCollection 2018.
9
Understanding the role of individual units in a deep neural network.理解深度神经网络中单个单元的作用。
Proc Natl Acad Sci U S A. 2020 Dec 1;117(48):30071-30078. doi: 10.1073/pnas.1907375117. Epub 2020 Sep 1.
10
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.

引用本文的文献

1
Computational reconstruction of mental representations using human behavior.使用人类行为进行心理表象的计算重建。
Nat Commun. 2024 May 17;15(1):4183. doi: 10.1038/s41467-024-48114-6.
2
Impact of word embedding models on text analytics in deep learning environment: a review.词嵌入模型对深度学习环境下文本分析的影响:综述
Artif Intell Rev. 2023 Feb 22:1-81. doi: 10.1007/s10462-023-10419-1.