• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

挖掘众包词典以理解词义的一致性和偏好

Mining a Crowdsourced Dictionary to Understand Consistency and Preference in Word Meanings.

作者信息

Johns Brendan T

机构信息

Department of Communicative Disorders and Sciences, University at Buffalo, Buffalo, NY, United States.

出版信息

Front Psychol. 2019 Feb 18;10:268. doi: 10.3389/fpsyg.2019.00268. eCollection 2019.

DOI:10.3389/fpsyg.2019.00268
PMID:30833917
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6387934/
Abstract

Big data approaches to psychology have become increasing popular (Jones, 2017). Two of the main developments of this line of research is the advent of distributional models of semantics (e.g., Landauer and Dumais, 1997), which learn the meaning of words from large text corpora, and the collection of mega datasets of human behavior (e.g., The English lexicon project; Balota et al., 2007). The current article combines these two approaches, with the goal being to understand the consistency and preference that people have for word meanings. This was accomplished by mining a large amount of data from an online, crowdsourced dictionary and analyzing this data with a distributional model. Overall, it was found that even for words that are not an active part of the language environment, there is a large amount of consistency in the word meanings that different people have. Additionally, it was demonstrated that users of a language have strong preferences for word meanings, such that definitions to words that do not conform to people's conceptions are rejected by a community of language users. The results of this article provides insights into the cultural evolution of word meanings, and sheds light on alternative methodologies that can be used to understand lexical behavior.

摘要

心理学中的大数据方法越来越受欢迎(琼斯,2017)。这一研究方向的两个主要进展是语义分布模型的出现(例如,兰道尔和杜梅斯,1997),该模型从大型文本语料库中学习单词的含义,以及人类行为海量数据集的收集(例如,英语词汇项目;巴洛塔等人,2007)。本文结合了这两种方法,目标是了解人们对词义的一致性和偏好。这是通过从一个在线众包词典中挖掘大量数据并用分布模型分析这些数据来实现的。总体而言,研究发现,即使对于那些并非语言环境中常用的单词,不同人所理解的词义也存在大量一致性。此外,研究表明,一种语言的使用者对词义有强烈的偏好,以至于不符合人们概念的单词定义会被语言使用者群体拒绝。本文的研究结果为词义的文化演变提供了见解,并揭示了可用于理解词汇行为的其他方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/9925976b4549/fpsyg-10-00268-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/dd1d944698f4/fpsyg-10-00268-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/e85fbdf9a8fb/fpsyg-10-00268-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/872fd8fdb090/fpsyg-10-00268-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/2bec20b9b3c8/fpsyg-10-00268-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/db34e750f997/fpsyg-10-00268-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/83ce52fde1a1/fpsyg-10-00268-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/9925976b4549/fpsyg-10-00268-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/dd1d944698f4/fpsyg-10-00268-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/e85fbdf9a8fb/fpsyg-10-00268-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/872fd8fdb090/fpsyg-10-00268-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/2bec20b9b3c8/fpsyg-10-00268-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/db34e750f997/fpsyg-10-00268-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/83ce52fde1a1/fpsyg-10-00268-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40fe/6387934/9925976b4549/fpsyg-10-00268-g007.jpg

相似文献

1
Mining a Crowdsourced Dictionary to Understand Consistency and Preference in Word Meanings.挖掘众包词典以理解词义的一致性和偏好
Front Psychol. 2019 Feb 18;10:268. doi: 10.3389/fpsyg.2019.00268. eCollection 2019.
2
Determining the Relativity of Word Meanings Through the Construction of Individualized Models of Semantic Memory.通过构建语义记忆的个体化模型来确定词义的相对性。
Cogn Sci. 2024 Feb;48(2):e13413. doi: 10.1111/cogs.13413.
3
Distributional social semantics: Inferring word meanings from communication patterns.分布社会语义学:从交流模式中推断词义。
Cogn Psychol. 2021 Dec;131:101441. doi: 10.1016/j.cogpsych.2021.101441. Epub 2021 Oct 16.
4
From sound to meaning: Phonology-to-Semantics mapping in visual word recognition.从声音到意义:视觉单词识别中的音系到语义映射。
Psychon Bull Rev. 2017 Jun;24(3):887-893. doi: 10.3758/s13423-016-1152-0.
5
A database of orthography-semantics consistency (OSC) estimates for 15,017 English words.一个包含 15017 个英语单词的正字法-语义一致性(OSC)估计值数据库。
Behav Res Methods. 2018 Aug;50(4):1482-1495. doi: 10.3758/s13428-018-1017-8.
6
A Large-Scale Analysis of Variance in Written Language.书面语言的大规模方差分析
Cogn Sci. 2018 May;42(4):1360-1374. doi: 10.1111/cogs.12583. Epub 2018 Jan 22.
7
Word Senses as Clusters of Meaning Modulations: A Computational Model of Polysemy.词义作为意义调制的聚类:一词多义的计算模型。
Cogn Sci. 2021 Apr;45(4):e12955. doi: 10.1111/cogs.12955.
8
Probing Lexical Ambiguity: Word Vectors Encode Number and Relatedness of Senses.探究词汇歧义:词向量编码词义的数量和关联性。
Cogn Sci. 2021 May;45(5):e12943. doi: 10.1111/cogs.12943.
9
A database of 629 English compound words: ratings of familiarity, lexeme meaning dominance, semantic transparency, age of acquisition, imageability, and sensory experience.629 个英语复合词数据库:熟悉度评分、词元意义主导性、语义透明度、习得年龄、形象性和感官体验。
Behav Res Methods. 2015 Dec;47(4):1004-1019. doi: 10.3758/s13428-014-0523-6.
10
Estimating the average need of semantic knowledge from distributional semantic models.从分布语义模型估计语义知识的平均需求。
Mem Cognit. 2017 Nov;45(8):1350-1370. doi: 10.3758/s13421-017-0732-1.

引用本文的文献

1
Apriori Algorithm for the Data Mining of Global Cyberspace Security Issues for Human Participatory Based on Association Rules.基于关联规则的人类参与式全球网络空间安全问题数据挖掘的Apriori算法
Front Psychol. 2021 Feb 9;11:582480. doi: 10.3389/fpsyg.2020.582480. eCollection 2020.
2
Commentary: Principles, Approaches and Challenges of Applying Big Data in Safety Psychology Research.评论:大数据在安全心理学研究中的应用原则、方法及挑战
Front Psychol. 2019 Dec 10;10:2801. doi: 10.3389/fpsyg.2019.02801. eCollection 2019.

本文引用的文献

1
A Large-Scale Semantic Analysis of Verbal Fluency Across the Aging Spectrum: Data From the Canadian Longitudinal Study on Aging.在老龄化光谱中进行言语流畅性的大规模语义分析:来自加拿大老龄化纵向研究的数据。
J Gerontol B Psychol Sci Soc Sci. 2020 Oct 16;75(9):e221-e230. doi: 10.1093/geronb/gbz003.
2
Examining word processing via a megastudy of conditional reading aloud.通过条件朗读的大规模研究来审视文字处理。
Q J Exp Psychol (Hove). 2018 Nov;71(11):2295-2313. doi: 10.1177/1747021817741269. Epub 2018 Jan 1.
3
Conceptualizing syntactic categories as semantic categories: Unifying part-of-speech identification and semantics using co-occurrence vector averaging.
将句法范畴概念化为语义范畴:使用共现向量平均统一词性识别和语义。
Behav Res Methods. 2019 Jun;51(3):1371-1398. doi: 10.3758/s13428-018-1118-4.
4
Using experiential optimization to build lexical representations.利用经验优化构建词汇表示。
Psychon Bull Rev. 2019 Feb;26(1):103-126. doi: 10.3758/s13423-018-1501-2.
5
Word prevalence norms for 62,000 English lemmas.62000 个英语词汇的词频规范。
Behav Res Methods. 2019 Apr;51(2):467-479. doi: 10.3758/s13428-018-1077-9.
6
Naturalistic multiattribute choice.自然主义多属性选择。
Cognition. 2018 Oct;179:71-88. doi: 10.1016/j.cognition.2018.05.025. Epub 2018 Jun 15.
7
A Large-Scale Analysis of Variance in Written Language.书面语言的大规模方差分析
Cogn Sci. 2018 May;42(4):1360-1374. doi: 10.1111/cogs.12583. Epub 2018 Jan 22.
8
Estimating the average need of semantic knowledge from distributional semantic models.从分布语义模型估计语义知识的平均需求。
Mem Cognit. 2017 Nov;45(8):1350-1370. doi: 10.3758/s13421-017-0732-1.
9
Release from PI: An analysis and a model.脱离 PI:分析与模型。
Psychon Bull Rev. 2018 Jun;25(3):932-950. doi: 10.3758/s13423-017-1327-3.
10
Compounding as Abstract Operation in Semantic Space: Investigating relational effects through a large-scale, data-driven computational model.语义空间中作为抽象操作的复合:通过大规模数据驱动计算模型研究关系效应
Cognition. 2017 Sep;166:207-224. doi: 10.1016/j.cognition.2017.05.026. Epub 2017 Jun 3.