阅读速度的计算句子级指标及其对句子理解的影响。

Computational Sentence-Level Metrics of Reading Speed and Its Ramifications for Sentence Comprehension.

作者信息

Sun Kun, Wang Rong

机构信息

School of Foreign Languages, Tongji University.

Department of Linguistics, University of Tübingen.

出版信息

Cogn Sci. 2025 Jul;49(7):e70092. doi: 10.1111/cogs.70092.

DOI:10.1111/cogs.70092

PMID:40692511

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12281087/

Abstract

The majority of research in computational psycholinguistics on sentence processing has focused on word-by-word incremental processing within sentences, rather than holistic sentence-level representations. This study introduces two novel computational approaches for quantifying sentence-level processing: sentence surprisal and sentence relevance. Using multilingual large language models (LLMs), we compute sentence surprisal through three methods, chain rule, next sentence prediction, and negative log-likelihood, and apply a "memory-aware" approach to calculate sentence-level semantic relevance based on convolution operations. The sentence-level metrics developed are tested and compared to validate whether they can predict the reading speed of sentences, and, further, we explore how sentence-level metrics take effects on human processing and comprehending sentences as a whole across languages. The results show that sentence-level metrics are highly capable of predicting sentence reading speed. Our results also indicate that these computational sentence-level metrics are exceptionally effective at predicting and explaining the processing difficulties encountered by readers in processing sentences as a whole across a variety of languages. The proposed sentence-level metrics offer significant interpretability and achieve high accuracy in predicting human sentence reading speed, as they capture unique aspects of comprehension difficulty beyond word-level measures. These metrics serve as valuable computational tools for investigating human sentence processing and advancing our understanding of naturalistic reading. Their strong performance and generalization capabilities highlight their potential to drive progress at the intersection of LLMs and cognitive science.

摘要

计算心理语言学中关于句子处理的大多数研究都集中在句子内逐词的增量处理上，而不是整体的句子层面表征。本研究引入了两种用于量化句子层面处理的新颖计算方法：句子意外程度和句子相关性。使用多语言大语言模型（LLMs），我们通过三种方法计算句子意外程度，即链式法则、下一句预测和负对数似然，并应用一种“记忆感知”方法基于卷积运算来计算句子层面的语义相关性。所开发的句子层面指标经过测试和比较，以验证它们是否能够预测句子的阅读速度，此外，我们还探究句子层面指标如何在跨语言的情况下对人类整体处理和理解句子产生影响。结果表明，句子层面指标非常能够预测句子阅读速度。我们的结果还表明，这些计算性句子层面指标在预测和解释读者在跨多种语言整体处理句子时遇到的处理困难方面格外有效。所提出的句子层面指标在预测人类句子阅读速度方面具有显著的可解释性并实现了高精度，因为它们捕捉了超出单词层面度量的理解困难的独特方面。这些指标作为有价值的计算工具，可用于研究人类句子处理并推进我们对自然阅读的理解。它们强大的性能和泛化能力凸显了它们在推动大语言模型与认知科学交叉领域取得进展的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db70/12281087/d293b307d0a1/COGS-49-e70092-g001.jpg

相似文献

Computational Sentence-Level Metrics of Reading Speed and Its Ramifications for Sentence Comprehension.

Cogn Sci. 2025 Jul;49(7):e70092. doi: 10.1111/cogs.70092.

To predict or not to predict: The role of context constraint and truth-value in negation processing.

Neuropsychologia. 2025 Sep 9;216:109167. doi: 10.1016/j.neuropsychologia.2025.109167. Epub 2025 May 29.

Short-Term Memory Impairment

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Diffindo! Precise language comprehension in older adulthood revealed by event-related brain potential studies of domain knowledge.

Cognition. 2025 Oct;263:106210. doi: 10.1016/j.cognition.2025.106210. Epub 2025 Jun 10.

Mapping sentence comprehension and syntactic complexity: evidence from 131 stroke survivors.

Brain Commun. 2024 Nov 15;6(6):fcae379. doi: 10.1093/braincomms/fcae379. eCollection 2024.

Self-Set Goals: Autistic Adults Facilitating Their Self-Determination Through Digitally Mediated Social Stories.

Autism Adulthood. 2025 Feb 5;7(1):25-38. doi: 10.1089/aut.2023.0063. eCollection 2025 Feb.

Detecting Redundant Health Survey Questions by Using Language-Agnostic Bidirectional Encoder Representations From Transformers Sentence Embedding: Algorithm Development Study.

JMIR Med Inform. 2025 Jun 10;13:e71687. doi: 10.2196/71687.

Tracking the dynamic word-by-word incremental reading through multimeasures.

J Exp Psychol Learn Mem Cogn. 2025 Aug;51(8):1324-1346. doi: 10.1037/xlm0001438. Epub 2025 Feb 13.

Coherence and comprehensibility: Large language models predict lay understanding of health-related content.

J Biomed Inform. 2025 Jan;161:104758. doi: 10.1016/j.jbi.2024.104758. Epub 2024 Dec 9.

本文引用的文献

Attention-aware semantic relevance predicting Chinese sentence reading.

Cognition. 2025 Feb;255:105991. doi: 10.1016/j.cognition.2024.105991. Epub 2024 Nov 26.

Predicting the next sentence (not word) in large language models: What model-brain alignment tells us about discourse comprehension.

Sci Adv. 2024 May 24;10(21):eadn7744. doi: 10.1126/sciadv.adn7744. Epub 2024 May 23.

BERTs of a feather: Studying inter- and intra-group communication via information theory and language models.

Behav Res Methods. 2024 Apr;56(4):3140-3160. doi: 10.3758/s13428-023-02267-2. Epub 2023 Nov 29.

Discourse coherence modulates use of predictive processing during sentence comprehension.

Cognition. 2024 Jan;242:105637. doi: 10.1016/j.cognition.2023.105637. Epub 2023 Oct 17.

Prediction during language comprehension: what is next?

Trends Cogn Sci. 2023 Nov;27(11):1032-1052. doi: 10.1016/j.tics.2023.08.003. Epub 2023 Sep 11.

Measuring Sentence Information via Surprisal: Theoretical and Clinical Implications in Nonfluent Aphasia.

Ann Neurol. 2023 Oct;94(4):647-657. doi: 10.1002/ana.26744. Epub 2023 Aug 18.

An interpretable measure of semantic similarity for predicting eye movements in reading.

Psychon Bull Rev. 2023 Aug;30(4):1227-1242. doi: 10.3758/s13423-022-02240-8. Epub 2023 Feb 2.

Over-reliance on English hinders cognitive science.

Trends Cogn Sci. 2022 Dec;26(12):1153-1170. doi: 10.1016/j.tics.2022.09.015. Epub 2022 Oct 14.

Shared computational principles for language processing in humans and deep language models.

Nat Neurosci. 2022 Mar;25(3):369-380. doi: 10.1038/s41593-022-01026-4. Epub 2022 Mar 7.

Expanding horizons of cross-linguistic research on reading: The Multilingual Eye-movement Corpus (MECO).

Behav Res Methods. 2022 Dec;54(6):2843-2863. doi: 10.3758/s13428-021-01772-6. Epub 2022 Feb 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

阅读速度的计算句子级指标及其对句子理解的影响。

Computational Sentence-Level Metrics of Reading Speed and Its Ramifications for Sentence Comprehension.

作者信息

Sun Kun, Wang Rong

机构信息

School of Foreign Languages, Tongji University.

Department of Linguistics, University of Tübingen.

出版信息

Cogn Sci. 2025 Jul;49(7):e70092. doi: 10.1111/cogs.70092.

DOI:10.1111/cogs.70092

PMID:40692511

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12281087/

Abstract

摘要

阅读速度的计算句子级指标及其对句子理解的影响。

Computational Sentence-Level Metrics of Reading Speed and Its Ramifications for Sentence Comprehension.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

阅读速度的计算句子级指标及其对句子理解的影响。

Computational Sentence-Level Metrics of Reading Speed and Its Ramifications for Sentence Comprehension.

作者信息

机构信息

出版信息

相似文献

本文引用的文献