• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大规模证据表明,单词可预测性对阅读时间的影响呈对数关系。

Large-scale evidence for logarithmic effects of word predictability on reading time.

机构信息

Department of Brain & Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139.

Department of Computer Science, Institute for Machine Learning, ETH Zürich, Zürich 8092, Schweiz.

出版信息

Proc Natl Acad Sci U S A. 2024 Mar 5;121(10):e2307876121. doi: 10.1073/pnas.2307876121. Epub 2024 Feb 29.

DOI:10.1073/pnas.2307876121
PMID:38422017
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10927576/
Abstract

During real-time language comprehension, our minds rapidly decode complex meanings from sequences of words. The difficulty of doing so is known to be related to words' contextual predictability, but what cognitive processes do these predictability effects reflect? In one view, predictability effects reflect facilitation due to anticipatory processing of words that are predictable from context. This view predicts a linear effect of predictability on processing demand. In another view, predictability effects reflect the costs of probabilistic inference over sentence interpretations. This view predicts either a logarithmic or a superlogarithmic effect of predictability on processing demand, depending on whether it assumes pressures toward a uniform distribution of information over time. The empirical record is currently mixed. Here, we revisit this question at scale: We analyze six reading datasets, estimate next-word probabilities with diverse statistical language models, and model reading times using recent advances in nonlinear regression. Results support a logarithmic effect of word predictability on processing difficulty, which favors probabilistic inference as a key component of human language processing.

摘要

在实时语言理解过程中,我们的大脑会迅速从单词序列中解码出复杂的含义。众所周知,这种理解的难度与单词的上下文可预测性有关,但这些可预测性效应反映了哪些认知过程呢?有一种观点认为,可预测性效应反映了由于对可以从上下文中预测到的单词进行预期处理而产生的促进作用。这种观点预测了可预测性对处理需求的线性影响。另一种观点认为,可预测性效应反映了对句子解释进行概率推理的成本。这种观点预测了可预测性对处理需求的对数或超对数效应,具体取决于它是否假设随着时间的推移,信息在均匀分布上的压力。目前,实证记录喜忧参半。在这里,我们从大规模上重新审视这个问题:我们分析了六个阅读数据集,使用不同的统计语言模型来估计下一个单词的概率,并使用非线性回归的最新进展来对阅读时间进行建模。结果支持单词可预测性对处理难度的对数效应,这有利于概率推理成为人类语言处理的关键组成部分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50de/10927576/762375b3f2d8/pnas.2307876121fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50de/10927576/e50a63181553/pnas.2307876121fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50de/10927576/ed37b8aaa324/pnas.2307876121fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50de/10927576/9ebed88f5104/pnas.2307876121fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50de/10927576/762375b3f2d8/pnas.2307876121fig04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50de/10927576/e50a63181553/pnas.2307876121fig01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50de/10927576/ed37b8aaa324/pnas.2307876121fig02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50de/10927576/9ebed88f5104/pnas.2307876121fig03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/50de/10927576/762375b3f2d8/pnas.2307876121fig04.jpg

相似文献

1
Large-scale evidence for logarithmic effects of word predictability on reading time.大规模证据表明,单词可预测性对阅读时间的影响呈对数关系。
Proc Natl Acad Sci U S A. 2024 Mar 5;121(10):e2307876121. doi: 10.1073/pnas.2307876121. Epub 2024 Feb 29.
2
The effect of word predictability on reading time is logarithmic.词的可预测性对阅读时间的影响是对数的。
Cognition. 2013 Sep;128(3):302-19. doi: 10.1016/j.cognition.2013.02.013. Epub 2013 Jun 6.
3
Word predictability effects are linear, not logarithmic: Implications for probabilistic models of sentence comprehension.词汇可预测性效应是线性的,而非对数性的:对句子理解概率模型的启示。
J Mem Lang. 2021 Feb;116. doi: 10.1016/j.jml.2020.104174. Epub 2020 Sep 18.
4
Word Frequency and Predictability Dissociate in Naturalistic Reading.自然阅读中单词频率与可预测性相互分离。
Open Mind (Camb). 2024 Mar 5;8:177-201. doi: 10.1162/opmi_a_00119. eCollection 2024.
5
Dissociable effects of prediction and integration during language comprehension: evidence from a large-scale study using brain potentials.语言理解过程中预测和整合的可分离效应:一项使用脑电波的大规模研究证据。
Philos Trans R Soc Lond B Biol Sci. 2020 Feb 3;375(1791):20180522. doi: 10.1098/rstb.2018.0522. Epub 2019 Dec 16.
6
Language Models Explain Word Reading Times Better Than Empirical Predictability.语言模型比经验可预测性能更好地解释单词阅读时间。
Front Artif Intell. 2022 Feb 2;4:730570. doi: 10.3389/frai.2021.730570. eCollection 2021.
7
Oral reading promotes predictive processing in Chinese sentence reading: eye movement evidence.口语阅读促进中文句子阅读中的预测加工:眼动证据。
PeerJ. 2024 Oct 17;12:e18307. doi: 10.7717/peerj.18307. eCollection 2024.
8
A further look at ageing and word predictability effects in Chinese reading: Evidence from one-character words.进一步探讨中文阅读中的老化和词汇可预测性效应:来自单字的证据。
Q J Exp Psychol (Hove). 2021 Jan;74(1):68-76. doi: 10.1177/1747021820951131. Epub 2020 Sep 11.
9
Semantic similarity, predictability, and models of sentence processing.语义相似度、可预测性与句子处理模型。
Cognition. 2012 Mar;122(3):267-79. doi: 10.1016/j.cognition.2011.11.011. Epub 2011 Dec 23.
10
Prior Context and Individual Alpha Frequency Influence Predictive Processing during Language Comprehension.先前语境和个体阿尔法频率影响语言理解中的预测加工。
J Cogn Neurosci. 2024 Sep 1;36(9):1898-1936. doi: 10.1162/jocn_a_02196.

引用本文的文献

1
A systematic evaluation of Dutch large language models' surprisal estimates in sentence, paragraph and book reading.对荷兰大语言模型在句子、段落和书籍阅读中的意外度估计进行的系统评估。
Behav Res Methods. 2025 Aug 18;57(9):266. doi: 10.3758/s13428-025-02774-4.
2
Sentence processing by humans and machines: Large language models as a tool to better understand human reading.人类与机器的句子处理:大型语言模型作为更好理解人类阅读的工具
Psychon Bull Rev. 2025 Aug 13. doi: 10.3758/s13423-025-02756-9.
3
A two-dimensional space of linguistic representations shared across individuals.

本文引用的文献

1
A Deep Learning Approach to Analyzing Continuous-Time Cognitive Processes.一种用于分析连续时间认知过程的深度学习方法。
Open Mind (Camb). 2024 Mar 13;8:235-264. doi: 10.1162/opmi_a_00126. eCollection 2024.
2
Word Frequency and Predictability Dissociate in Naturalistic Reading.自然阅读中单词频率与可预测性相互分离。
Open Mind (Camb). 2024 Mar 5;8:177-201. doi: 10.1162/opmi_a_00119. eCollection 2024.
3
The Plausibility of Sampling as an Algorithmic Theory of Sentence Processing.抽样作为句子处理算法理论的合理性。
个体间共享的语言表征二维空间。
bioRxiv. 2025 May 23:2025.05.21.655330. doi: 10.1101/2025.05.21.655330.
4
The timing of spontaneous eye blinks in text reading suggests cognitive role.文本阅读中自发眨眼的时机表明其具有认知作用。
Sci Rep. 2025 Jun 5;15(1):19849. doi: 10.1038/s41598-025-04839-y.
5
Comparison of Large Language Model with Aphasia.大语言模型与失语症的比较。
Adv Sci (Weinh). 2025 Jun;12(22):e2414016. doi: 10.1002/advs.202414016. Epub 2025 May 14.
6
Expectation violations signal goals in novel human communication.期望违背在新颖的人际交流中标志着目标。
Nat Commun. 2025 Feb 26;16(1):1989. doi: 10.1038/s41467-025-57025-z.
7
Prediction in reading: A review of predictability effects, their theoretical implications, and beyond.阅读中的预测:可预测性效应、其理论意义及其他方面的综述
Psychon Bull Rev. 2025 Jun;32(3):973-1006. doi: 10.3758/s13423-024-02588-z. Epub 2024 Oct 31.
8
Language models outperform cloze predictability in a cognitive model of reading.语言模型在阅读认知模型中优于完形预测能力。
PLoS Comput Biol. 2024 Sep 25;20(9):e1012117. doi: 10.1371/journal.pcbi.1012117. eCollection 2024 Sep.
9
On the Mathematical Relationship Between Contextual Probability and N400 Amplitude.关于情境概率与N400波幅之间的数学关系。
Open Mind (Camb). 2024 Jun 28;8:859-897. doi: 10.1162/opmi_a_00150. eCollection 2024.
10
Clinical efficacy of pre-trained large language models through the lens of aphasia.从失语症的角度看预先训练的大型语言模型的临床疗效。
Sci Rep. 2024 Jul 6;14(1):15573. doi: 10.1038/s41598-024-66576-y.
Open Mind (Camb). 2023 Jul 21;7:350-391. doi: 10.1162/opmi_a_00086. eCollection 2023.
4
Context-based facilitation of semantic access follows both logarithmic and linear functions of stimulus probability.基于上下文的语义通达促进作用遵循刺激概率的对数函数和线性函数。
J Mem Lang. 2022 Apr;123. doi: 10.1016/j.jml.2021.104311. Epub 2021 Dec 20.
5
A resource-rational model of human processing of recursive linguistic structure.递归语言结构的人类处理的资源理性模型。
Proc Natl Acad Sci U S A. 2022 Oct 25;119(43):e2122602119. doi: 10.1073/pnas.2122602119. Epub 2022 Oct 19.
6
Robust Effects of Working Memory Demand during Naturalistic Language Comprehension in Language-Selective Cortex.自然语言理解过程中工作记忆需求对语言选择皮层的强大影响。
J Neurosci. 2022 Sep 28;42(39):7412-7430. doi: 10.1523/JNEUROSCI.1894-21.2022.
7
A hierarchy of linguistic predictions during natural language comprehension.自然语言理解过程中的语言预测层次。
Proc Natl Acad Sci U S A. 2022 Aug 9;119(32):e2201968119. doi: 10.1073/pnas.2201968119. Epub 2022 Aug 3.
8
Comparison of Structural Parsers and Neural Language Models as Surprisal Estimators.作为惊奇度估计器的结构解析器与神经语言模型的比较。
Front Artif Intell. 2022 Mar 3;5:777963. doi: 10.3389/frai.2022.777963. eCollection 2022.
9
Shared computational principles for language processing in humans and deep language models.人类和深度语言模型语言处理的共享计算原则。
Nat Neurosci. 2022 Mar;25(3):369-380. doi: 10.1038/s41593-022-01026-4. Epub 2022 Mar 7.
10
Language Models Explain Word Reading Times Better Than Empirical Predictability.语言模型比经验可预测性能更好地解释单词阅读时间。
Front Artif Intell. 2022 Feb 2;4:730570. doi: 10.3389/frai.2021.730570. eCollection 2021.