• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

整合经验数据和分布数据以学习语义表示。

Integrating experiential and distributional data to learn semantic representations.

作者信息

Andrews Mark, Vigliocco Gabriella, Vinson David

机构信息

Cognitive, Perceptual and Brain Sciences, University College London, London, UK.

出版信息

Psychol Rev. 2009 Jul;116(3):463-98. doi: 10.1037/a0016261.

DOI:10.1037/a0016261
PMID:19618982
Abstract

The authors identify 2 major types of statistical data from which semantic representations can be learned. These are denoted as experiential data and distributional data. Experiential data are derived by way of experience with the physical world and comprise the sensory-motor data obtained through sense receptors. Distributional data, by contrast, describe the statistical distribution of words across spoken and written language. The authors claim that experiential and distributional data represent distinct data types and that each is a nontrivial source of semantic information. Their theoretical proposal is that human semantic representations are derived from an optimal statistical combination of these 2 data types. Using a Bayesian probabilistic model, they demonstrate how word meanings can be learned by treating experiential and distributional data as a single joint distribution and learning the statistical structure that underlies it. The semantic representations that are learned in this manner are measurably more realistic-as verified by comparison to a set of human-based measures of semantic representation-than those available from either data type individually or from both sources independently. This is not a result of merely using quantitatively more data, but rather it is because experiential and distributional data are qualitatively distinct, yet intercorrelated, types of data. The semantic representations that are learned are based on statistical structures that exist both within and between the experiential and distributional data types.

摘要

作者识别出两类主要的统计数据,从中可以学习语义表征。这些数据被称为经验数据和分布数据。经验数据是通过与物理世界的交互获得的,包括通过感官受体获取的感觉运动数据。相比之下,分布数据描述了单词在口语和书面语中的统计分布。作者声称,经验数据和分布数据代表了不同的数据类型,且每一种都是语义信息的重要来源。他们的理论主张是,人类语义表征源自这两种数据类型的最优统计组合。通过使用贝叶斯概率模型,他们展示了如何将经验数据和分布数据视为一个联合分布,并学习其背后的统计结构,从而习得单词的含义。通过与一组基于人类的语义表征度量标准进行比较验证,以这种方式习得的语义表征明显更贴近现实,比单独从任何一种数据类型或从这两种数据类型独立得出的表征都更具现实性。这并非仅仅是使用了数量更多的数据所致,而是因为经验数据和分布数据在性质上截然不同,但又相互关联。所习得的语义表征基于经验数据和分布数据类型内部及之间存在的统计结构。

相似文献

1
Integrating experiential and distributional data to learn semantic representations.整合经验数据和分布数据以学习语义表示。
Psychol Rev. 2009 Jul;116(3):463-98. doi: 10.1037/a0016261.
2
The primacy of experience in language processing: Semantic priming is driven primarily by experiential similarity.经验在语言处理中的首要地位:语义启动主要是由经验相似性驱动的。
Neuropsychologia. 2024 Aug 13;201:108939. doi: 10.1016/j.neuropsychologia.2024.108939. Epub 2024 Jun 18.
3
Redundancy in perceptual and linguistic experience: comparing feature-based and distributional models of semantic representation.感知与语言经验中的冗余:比较基于特征和分布的语义表征模型
Top Cogn Sci. 2011 Apr;3(2):303-45. doi: 10.1111/j.1756-8765.2010.01111.x. Epub 2010 Aug 19.
4
The Primacy of Experience in Language Processing: Semantic Priming Is Driven Primarily by Experiential Similarity.语言处理中经验的首要性:语义启动主要由经验相似性驱动。
bioRxiv. 2023 Dec 18:2023.03.21.533703. doi: 10.1101/2023.03.21.533703.
5
The hidden Markov Topic model: a probabilistic model of semantic representation.隐马尔可夫主题模型:一种语义表征的概率模型。
Top Cogn Sci. 2010 Jan;2(1):101-13. doi: 10.1111/j.1756-8765.2009.01074.x.
6
Determining the Relativity of Word Meanings Through the Construction of Individualized Models of Semantic Memory.通过构建语义记忆的个体化模型来确定词义的相对性。
Cogn Sci. 2024 Feb;48(2):e13413. doi: 10.1111/cogs.13413.
7
Agraphia失写症
8
Distinct fronto-temporal substrates of distributional and taxonomic similarity among words: evidence from RSA of BOLD signals.词的分布相似性和分类相似性的额颞叶不同基础:基于 BOLD 信号的 RSA 证据。
Neuroimage. 2021 Jan 1;224:117408. doi: 10.1016/j.neuroimage.2020.117408. Epub 2020 Oct 10.
9
Probing Lexical Ambiguity: Word Vectors Encode Number and Relatedness of Senses.探究词汇歧义:词向量编码词义的数量和关联性。
Cogn Sci. 2021 May;45(5):e12943. doi: 10.1111/cogs.12943.
10
Can prediction-based distributional semantic models predict typicality?基于预测的分布语义模型能预测典型性吗?
Q J Exp Psychol (Hove). 2019 Aug;72(8):2084-2109. doi: 10.1177/1747021819830949. Epub 2019 Feb 21.

引用本文的文献

1
Mapping distinct neural pathways associated with conceptual and lexical verb retrieval priming in aphasia: A preliminary study.绘制失语症中与概念性和词汇性动词检索启动相关的不同神经通路:一项初步研究。
Neuroimage Clin. 2025 Aug 13;48:103867. doi: 10.1016/j.nicl.2025.103867.
2
Leveraging Context for Perceptual Prediction Using Word Embeddings.利用词嵌入的上下文进行感知预测。
Cogn Sci. 2025 Jun;49(6):e70072. doi: 10.1111/cogs.70072.
3
Learning about color from language.从语言中学习颜色。
Commun Psychol. 2025 Apr 14;3(1):60. doi: 10.1038/s44271-025-00230-9.
4
Development of Artificial Intelligence Systems for Chronic Kidney Disease.慢性肾脏病人工智能系统的开发
JMA J. 2025 Jan 15;8(1):48-56. doi: 10.31662/jmaj.2024-0090. Epub 2024 Sep 6.
5
Visual experience modulates the sensitivity to the distributional history of words in natural language.视觉体验会调节对自然语言中词汇分布历史的敏感度。
Psychon Bull Rev. 2025 Feb;32(1):472-481. doi: 10.3758/s13423-024-02557-6. Epub 2024 Aug 22.
6
Symbol ungrounding: what the successes (and failures) of large language models reveal about human cognition.符号去 grounding:大型语言模型的成功(和失败)揭示了人类认知的什么。
Philos Trans R Soc Lond B Biol Sci. 2024 Oct 7;379(1911):20230149. doi: 10.1098/rstb.2023.0149. Epub 2024 Aug 19.
7
Invariant representations in abstract concept grounding - the physical world in grounded cognition.抽象概念基础中的不变表示——具身认知中的物理世界。
Psychon Bull Rev. 2024 Dec;31(6):2558-2580. doi: 10.3758/s13423-024-02522-3. Epub 2024 May 28.
8
Mathematical problem solving is modulated by word priming.数学问题解决受到词汇启动的调节。
Psych J. 2024 Jun;13(3):465-476. doi: 10.1002/pchj.732. Epub 2024 Feb 1.
9
New marker for chronic kidney disease progression and mortality in medical-word virtual space.医学字空间中慢性肾脏病进展和死亡的新标志物。
Sci Rep. 2024 Jan 18;14(1):1661. doi: 10.1038/s41598-024-52235-9.
10
Feats: A database of semantic features for early produced noun concepts.特点:早期生成名词概念的语义特征数据库。
Behav Res Methods. 2024 Apr;56(4):3259-3279. doi: 10.3758/s13428-023-02242-x. Epub 2023 Dec 26.