语言模型比经验可预测性能更好地解释单词阅读时间。

Language Models Explain Word Reading Times Better Than Empirical Predictability.

作者信息

Hofmann Markus J, Remus Steffen, Biemann Chris, Radach Ralph, Kuchinke Lars

机构信息

Department of Psychology, University of Wuppertal, Wuppertal, Germany.

Department of Informatics, Universität Hamburg, Hamburg, Germany.

出版信息

Front Artif Intell. 2022 Feb 2;4:730570. doi: 10.3389/frai.2021.730570. eCollection 2021.

DOI:10.3389/frai.2021.730570

PMID:35187472

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8847793/

Abstract

Though there is a strong consensus that word length and frequency are the most important single-word features determining visual-orthographic access to the mental lexicon, there is less agreement as how to best capture syntactic and semantic factors. The traditional approach in cognitive reading research assumes that word predictability from sentence context is best captured by cloze completion probability (CCP) derived from human performance data. We review recent research suggesting that probabilistic language models provide deeper explanations for syntactic and semantic effects than CCP. Then we compare CCP with three probabilistic language models for predicting word viewing times in an English and a German eye tracking sample: (1) Symbolic n-gram models consolidate syntactic and semantic short-range relations by computing the probability of a word to occur, given two preceding words. (2) Topic models rely on subsymbolic representations to capture long-range semantic similarity by word co-occurrence counts in documents. (3) In recurrent neural networks (RNNs), the subsymbolic units are trained to predict the next word, given all preceding words in the sentences. To examine lexical retrieval, these models were used to predict single fixation durations and gaze durations to capture rapidly successful and standard lexical access, and total viewing time to capture late semantic integration. The linear item-level analyses showed greater correlations of all language models with all eye-movement measures than CCP. Then we examined non-linear relations between the different types of predictability and the reading times using generalized additive models. N-gram and RNN probabilities of the present word more consistently predicted reading performance compared with topic models or CCP. For the effects of last-word probability on current-word viewing times, we obtained the best results with n-gram models. Such count-based models seem to best capture short-range access that is still underway when the eyes move on to the subsequent word. The prediction-trained RNN models, in contrast, better predicted early preprocessing of the next word. In sum, our results demonstrate that the different language models account for differential cognitive processes during reading. We discuss these algorithmically concrete blueprints of lexical consolidation as theoretically deep explanations for human reading.

摘要

尽管人们普遍强烈认为单词长度和频率是决定视觉正字法进入心理词典的最重要的单字特征，但对于如何最好地捕捉句法和语义因素，人们的意见却不太一致。认知阅读研究中的传统方法假定，句子语境中的单词可预测性可以通过从人类表现数据中得出的完形填空完成概率（CCP）来最好地捕捉。我们回顾了最近的研究，这些研究表明概率语言模型比CCP能更深入地解释句法和语义效应。然后我们将CCP与三种概率语言模型进行比较，以预测英语和德语眼动样本中的单词注视时间：（1）符号n元语法模型通过计算给定前两个单词时一个单词出现的概率，巩固句法和语义的短程关系。（2）主题模型依靠亚符号表示，通过文档中的单词共现次数来捕捉长程语义相似性。（3）在循环神经网络（RNN）中，亚符号单元经过训练，根据句子中所有前面的单词来预测下一个单词。为了检验词汇检索，这些模型被用来预测单次注视持续时间和注视持续时间，以捕捉快速成功的和标准的词汇访问，以及总注视时间以捕捉后期语义整合。线性项目级分析表明，所有语言模型与所有眼动指标的相关性都比CCP更高。然后我们使用广义相加模型检验了不同类型的可预测性与阅读时间之间的非线性关系。与主题模型或CCP相比，当前单词的n元语法和RNN概率更一致地预测了阅读表现。对于最后一个单词的概率对当前单词注视时间的影响，我们用n元语法模型得到了最好的结果。这种基于计数的模型似乎能最好地捕捉当眼睛移动到后续单词时仍在进行的短程访问。相比之下，经过预测训练的RNN模型能更好地预测下一个单词的早期预处理。总之，我们的结果表明，不同的语言模型解释了阅读过程中不同的认知过程。我们将这些词汇巩固的算法具体蓝图作为对人类阅读的理论深度解释进行讨论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e431/8847793/e6cae5821f6c/frai-04-730570-g0001.jpg

相似文献

Language Models Explain Word Reading Times Better Than Empirical Predictability.

Front Artif Intell. 2022 Feb 2;4:730570. doi: 10.3389/frai.2021.730570. eCollection 2021.

Morphosyntactic but not lexical corpus-based probabilities can substitute for cloze probabilities in reading experiments.

PLoS One. 2021 Jan 28;16(1):e0246133. doi: 10.1371/journal.pone.0246133. eCollection 2021.

An eye on semantics: a study on the influence of concreteness and predictability on early fixation durations.

Lang Cogn Neurosci. 2023 Nov 9;39(3):302-316. doi: 10.1080/23273798.2023.2274558. eCollection 2024.

Lack of contextual-word predictability during reading in patients with mild Alzheimer disease.

Neuropsychologia. 2014 Sep;62:143-51. doi: 10.1016/j.neuropsychologia.2014.07.023. Epub 2014 Jul 28.

Cloze probability, predictability ratings, and computational estimates for 205 English sentences, aligned with existing EEG and reading time data.

Behav Res Methods. 2024 Aug;56(5):5190-5213. doi: 10.3758/s13428-023-02261-8. Epub 2023 Oct 25.

Semantic feature activation takes time: longer SOA elicits earlier priming effects during reading.

Cogn Process. 2022 May;23(2):309-318. doi: 10.1007/s10339-022-01084-3. Epub 2022 Mar 7.

Lexical and message-level sentence context effects on fixation times in reading.

J Exp Psychol Learn Mem Cogn. 1994 Jan;20(1):92-103. doi: 10.1037//0278-7393.20.1.92.

Effects of word predictability on eye movements during Arabic reading.

Atten Percept Psychophys. 2022 Jan;84(1):10-24. doi: 10.3758/s13414-021-02375-1. Epub 2021 Oct 10.

The effect of word predictability on reading time is logarithmic.

Cognition. 2013 Sep;128(3):302-19. doi: 10.1016/j.cognition.2013.02.013. Epub 2013 Jun 6.

Linguistic networks associated with lexical, semantic and syntactic predictability in reading: A fixation-related fMRI study.

Neuroimage. 2019 Apr 1;189:224-240. doi: 10.1016/j.neuroimage.2019.01.018. Epub 2019 Jan 14.

引用本文的文献

Sentence processing by humans and machines: Large language models as a tool to better understand human reading.

Psychon Bull Rev. 2025 Aug 13. doi: 10.3758/s13423-025-02756-9.

A Japanese LDA model for automatic clustering analysis of semantic verbal fluency tests.

Behav Res Methods. 2025 Jun 30;57(8):209. doi: 10.3758/s13428-025-02696-1.

Prediction in reading: A review of predictability effects, their theoretical implications, and beyond.

Psychon Bull Rev. 2025 Jun;32(3):973-1006. doi: 10.3758/s13423-024-02588-z. Epub 2024 Oct 31.

Language models outperform cloze predictability in a cognitive model of reading.

PLoS Comput Biol. 2024 Sep 25;20(9):e1012117. doi: 10.1371/journal.pcbi.1012117. eCollection 2024 Sep.

Word Frequency and Predictability Dissociate in Naturalistic Reading.

Open Mind (Camb). 2024 Mar 5;8:177-201. doi: 10.1162/opmi_a_00119. eCollection 2024.

Large-scale evidence for logarithmic effects of word predictability on reading time.

Proc Natl Acad Sci U S A. 2024 Mar 5;121(10):e2307876121. doi: 10.1073/pnas.2307876121. Epub 2024 Feb 29.

Individual word representations dissociate from linguistic context along a cortical unimodal to heteromodal gradient.

Hum Brain Mapp. 2024 Feb 1;45(2):e26607. doi: 10.1002/hbm.26607.

The Plausibility of Sampling as an Algorithmic Theory of Sentence Processing.

Open Mind (Camb). 2023 Jul 21;7:350-391. doi: 10.1162/opmi_a_00086. eCollection 2023.

A study on surprisal and semantic relatedness for eye-tracking data prediction.

Front Psychol. 2023 Feb 2;14:1112365. doi: 10.3389/fpsyg.2023.1112365. eCollection 2023.

本文引用的文献

Morphosyntactic but not lexical corpus-based probabilities can substitute for cloze probabilities in reading experiments.

PLoS One. 2021 Jan 28;16(1):e0246133. doi: 10.1371/journal.pone.0246133. eCollection 2021.

Word predictability effects are linear, not logarithmic: Implications for probabilistic models of sentence comprehension.

J Mem Lang. 2021 Feb;116. doi: 10.1016/j.jml.2020.104174. Epub 2020 Sep 18.

Human and computer estimations of Predictability of words in written language.

Sci Rep. 2020 Mar 10;10(1):4396. doi: 10.1038/s41598-020-61353-z.

Simple Co-Occurrence Statistics Reproducibly Predict Association Ratings.

Cogn Sci. 2018 Sep;42(7):2287-2312. doi: 10.1111/cogs.12662. Epub 2018 Aug 11.

OB1-reader: A model of word recognition and eye movements in text reading.

Psychol Rev. 2018 Nov;125(6):969-984. doi: 10.1037/rev0000119. Epub 2018 Aug 6.

Limits on lexical prediction during reading.

Cogn Psychol. 2016 Aug;88:22-60. doi: 10.1016/j.cogpsych.2016.06.002. Epub 2016 Jul 1.

Deep learning.

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

The construction of meaning.

Top Cogn Sci. 2011 Apr;3(2):346-70. doi: 10.1111/j.1756-8765.2010.01107.x. Epub 2010 Aug 18.

The effect of word predictability on reading time is logarithmic.

Cognition. 2013 Sep;128(3):302-19. doi: 10.1016/j.cognition.2013.02.013. Epub 2013 Jun 6.

A framework for modeling the interaction of syntactic processing and eye movement control.

Top Cogn Sci. 2013 Jul;5(3):452-74. doi: 10.1111/tops.12026. Epub 2013 May 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

语言模型比经验可预测性能更好地解释单词阅读时间。

Language Models Explain Word Reading Times Better Than Empirical Predictability.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献