• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

超越基于计数的词频:人工智能生成的单词和短语熟悉度估计是语言知识的一个有趣的附加指标。

Moving beyond word frequency based on tally counting: AI-generated familiarity estimates of words and phrases are an interesting additional index of language knowledge.

作者信息

Brysbaert Marc, Martínez Gonzalo, Reviriego Pedro

机构信息

Department of Experimental Psychology, Ghent University, 9000, Ghent, Belgium.

Universidad Carlos III de Madrid, Avenida de la Universidad, 30, 28911, Leganés, Madrid, Spain.

出版信息

Behav Res Methods. 2024 Dec 28;57(1):28. doi: 10.3758/s13428-024-02561-7.

DOI:10.3758/s13428-024-02561-7
PMID:39733132
Abstract

This study investigates the potential of large language models (LLMs) to estimate the familiarity of words and multi-word expressions (MWEs). We validated LLM estimates for isolated words using existing human familiarity ratings and found strong correlations. LLM familiarity estimates performed even better in predicting lexical decision and naming performance in megastudies than the best available word frequency measures. We then applied LLM estimates to MWEs, also finding their effectiveness in measuring familiarity for these expressions. We have created a list of more than 400,000 English words and MWEs with LLM-generated familiarity estimates, which we hope will be a valuable resource for researchers. There is also a cleaned-up list of nearly 150,000 entries, excluding lesser-known stimuli, to streamline stimulus selection. Our findings highlight the advantages of LLM-based familiarity estimates, including their better performance than traditional word frequency measures (particularly for predicting word recognition accuracy), their ability to generalize to MWEs, availability for large lists of words, and ease of obtaining new estimates for all types of stimuli.

摘要

本研究调查了大语言模型(LLMs)估计单词和多词表达式(MWEs)熟悉度的潜力。我们使用现有的人类熟悉度评分验证了大语言模型对孤立单词的估计,并发现了很强的相关性。在大型研究中,大语言模型的熟悉度估计在预测词汇判断和命名表现方面比现有的最佳词频指标表现更好。然后,我们将大语言模型的估计应用于多词表达式,也发现它们在测量这些表达式的熟悉度方面是有效的。我们创建了一个包含超过40万个英语单词和多词表达式的列表,并给出了由大语言模型生成的熟悉度估计,我们希望这将成为研究人员的宝贵资源。还有一个经过清理的列表,包含近15万个条目,排除了不太知名的刺激词,以简化刺激词的选择。我们的研究结果突出了基于大语言模型的熟悉度估计的优势,包括它们比传统词频指标表现更好(特别是在预测单词识别准确性方面),能够推广到多词表达式,可用于大量单词列表,以及易于获得所有类型刺激词的新估计。

相似文献

1
Moving beyond word frequency based on tally counting: AI-generated familiarity estimates of words and phrases are an interesting additional index of language knowledge.超越基于计数的词频:人工智能生成的单词和短语熟悉度估计是语言知识的一个有趣的附加指标。
Behav Res Methods. 2024 Dec 28;57(1):28. doi: 10.3758/s13428-024-02561-7.
2
Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal.使用大语言模型估计多词表达的特征:具体性、效价、唤醒度。
Behav Res Methods. 2024 Dec 4;57(1):5. doi: 10.3758/s13428-024-02515-z.
3
AI-generated estimates of familiarity, concreteness, valence, and arousal for over 100,000 Spanish words.人工智能对超过10万个西班牙语单词的熟悉度、具体性、效价和唤醒度的估计。
Q J Exp Psychol (Hove). 2024 Dec 24:17470218241306694. doi: 10.1177/17470218241306694.
4
A database of 629 English compound words: ratings of familiarity, lexeme meaning dominance, semantic transparency, age of acquisition, imageability, and sensory experience.629 个英语复合词数据库:熟悉度评分、词元意义主导性、语义透明度、习得年龄、形象性和感官体验。
Behav Res Methods. 2015 Dec;47(4):1004-1019. doi: 10.3758/s13428-014-0523-6.
5
Familiarity ratings for 24,325 simplified Chinese words.24325个简体中文字的熟悉度评级
Behav Res Methods. 2023 Apr;55(3):1496-1509. doi: 10.3758/s13428-022-01878-5. Epub 2022 Jun 6.
6
Do the effects of subjective frequency and age of acquisition survive better word frequency norms?主观频率和习得年龄的影响在更好的词频规范下是否依然存在?
Q J Exp Psychol (Hove). 2011 Mar;64(3):545-59. doi: 10.1080/17470218.2010.503374. Epub 2010 Aug 9.
7
Decoding the essence of two-character Chinese words: Unveiling valence, arousal, concreteness, familiarity, and imageability through word norming.解析二字词的本质:通过词频规范揭示词义、情感、具体性、熟悉度和形象度。
Behav Res Methods. 2024 Oct;56(7):7574-7601. doi: 10.3758/s13428-024-02437-w. Epub 2024 May 15.
8
The Italian Crowdsourcing Project: Visual word recognition times for 130,495 Italian words.意大利众包项目:130495个意大利语单词的视觉单词识别时间
Behav Res Methods. 2024 Dec 28;57(1):26. doi: 10.3758/s13428-024-02548-4.
9
Large Language Models in Worldwide Medical Exams: Platform Development and Comprehensive Analysis.全球医学考试中的大语言模型:平台开发与综合分析
J Med Internet Res. 2024 Dec 27;26:e66114. doi: 10.2196/66114.
10
Lexical and semantic age-of-acquisition effects on word naming in Spanish.词汇和语义习得年龄对西班牙语单词命名的影响。
Mem Cognit. 2013 Feb;41(2):297-311. doi: 10.3758/s13421-012-0263-8.

引用本文的文献

1
A systematic evaluation of Dutch large language models' surprisal estimates in sentence, paragraph and book reading.对荷兰大语言模型在句子、段落和书籍阅读中的意外度估计进行的系统评估。
Behav Res Methods. 2025 Aug 18;57(9):266. doi: 10.3758/s13428-025-02774-4.

本文引用的文献

1
Word Forms Reflect Trade-Offs Between Speaker Effort and Robust Listener Recognition.词形反映说话者努力与稳健听话者识别之间的权衡。
Cogn Sci. 2024 Jul;48(7):e13478. doi: 10.1111/cogs.13478.
2
Large Language Models and the Wisdom of Small Crowds.大语言模型与小众群体的智慧
Open Mind (Camb). 2024 May 20;8:723-738. doi: 10.1162/opmi_a_00144. eCollection 2024.
3
Artificial intelligence and illusions of understanding in scientific research.人工智能与科研中的理解错觉。
Nature. 2024 Mar;627(8002):49-58. doi: 10.1038/s41586-024-07146-0. Epub 2024 Mar 6.
4
Can large language models help augment English psycholinguistic datasets?大型语言模型能否帮助扩充英语心理语言学数据集?
Behav Res Methods. 2024 Sep;56(6):6082-6100. doi: 10.3758/s13428-024-02337-z. Epub 2024 Jan 23.
5
Lexical Processing Strongly Affects Reading Times But Not Skipping During Natural Reading.词汇处理对自然阅读过程中的阅读时间有强烈影响,但对跳读没有影响。
Open Mind (Camb). 2023 Oct 1;7:757-783. doi: 10.1162/opmi_a_00099. eCollection 2023.
6
The Children's Picture Books Lexicon (CPB-LEX): A large-scale lexical database from children's picture books.《儿童图画书词汇表》(CPB-LEX):一个来自儿童图画书的大规模词汇数据库。
Behav Res Methods. 2024 Aug;56(5):4504-4521. doi: 10.3758/s13428-023-02198-y. Epub 2023 Aug 11.
7
Comparing word frequency, semantic diversity, and semantic distinctiveness in lexical organization.比较词汇组织中的词频、语义多样性和语义独特性。
J Exp Psychol Gen. 2023 Jun;152(6):1814-1823. doi: 10.1037/xge0001407.
8
Can AI language models replace human participants?人工智能语言模型能否替代人类参与者?
Trends Cogn Sci. 2023 Jul;27(7):597-600. doi: 10.1016/j.tics.2023.04.008. Epub 2023 May 10.
9
Iconicity ratings for 14,000+ English words.一万四千多个英语单词的象似性评分。
Behav Res Methods. 2024 Mar;56(3):1640-1655. doi: 10.3758/s13428-023-02112-6. Epub 2023 Apr 20.
10
SCOPE: The South Carolina psycholinguistic metabase.范围:南卡罗来纳州心理语言学元数据库。
Behav Res Methods. 2023 Sep;55(6):2853-2884. doi: 10.3758/s13428-022-01934-0. Epub 2022 Aug 15.