人类与机器的句子处理：大型语言模型作为更好理解人类阅读的工具

Sentence processing by humans and machines: Large language models as a tool to better understand human reading.

作者信息

Kaye Nikki G, Gordon Peter C

机构信息

Department of Psychology & Neuroscience, The University of North Carolina at Chapel Hill, CB#3270, Chapel Hill, NC, 26599-3270, USA.

出版信息

Psychon Bull Rev. 2025 Aug 13. doi: 10.3758/s13423-025-02756-9.

DOI:10.3758/s13423-025-02756-9

PMID:40804188

Abstract

Online measures of reading have been studied with the goal of understanding how humans process language incrementally as they progress through a text. A focus of this research has been on pinpointing how the context of a word influences its processing. Quantitatively measuring the effects of context has proven difficult but with advances in artificial intelligence, large language models (LLMs) are more capable of generating humanlike language, drawing solely on information about the probabilistic relationships of units of language (e.g., words) occurring together. LLMs can be used to estimate the probability of any word in the model's vocabulary occurring as the next word in a given context. These next-word probabilities can be used in the calculation of information theoretic metrics, such as entropy and surprisal, which can be assessed as measures of word-by-word processing load. This is done by analyzing whether entropy and surprisal derived from language models predict variance in online measures of human reading comprehension (e.g., eye-movement, self-paced reading, ERP data). The present review synthesizes empirical findings on this topic and evaluates their methodological and theoretical implications.

摘要

对在线阅读测量进行了研究，目的是了解人类在阅读文本过程中如何逐步处理语言。这项研究的一个重点是确定单词的上下文如何影响其处理过程。事实证明，定量测量上下文的影响很困难，但随着人工智能的发展，大型语言模型（LLMs）更有能力生成类人语言，仅依靠关于一起出现的语言单位（如单词）的概率关系的信息。大型语言模型可用于估计模型词汇表中任何单词在给定上下文中作为下一个单词出现的概率。这些下一个单词的概率可用于计算信息理论指标，如熵和意外性，它们可作为逐词处理负荷的度量进行评估。这是通过分析语言模型得出的熵和意外性是否能预测人类阅读理解的在线测量（如眼动、自定步速阅读、ERP数据）中的差异来实现的。本综述综合了关于该主题的实证研究结果，并评估了它们的方法学和理论意义。

相似文献

Sentence processing by humans and machines: Large language models as a tool to better understand human reading.人类与机器的句子处理：大型语言模型作为更好理解人类阅读的工具

Psychon Bull Rev. 2025 Aug 13. doi: 10.3758/s13423-025-02756-9.

Short-Term Memory Impairment短期记忆障碍

The agreement of phonetic transcriptions between paediatric speech and language therapists transcribing a disordered speech sample.儿科言语和语言治疗师转写语音样本的音标转录的一致性。

Int J Lang Commun Disord. 2024 Sep-Oct;59(5):1981-1995. doi: 10.1111/1460-6984.13043. Epub 2024 Jun 8.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Methodologies for assessing morphosyntactic ability in people with Alzheimer's disease.评估阿尔茨海默病患者形态句法能力的方法。

Int J Lang Commun Disord. 2024 Jan-Feb;59(1):38-57. doi: 10.1111/1460-6984.12862. Epub 2023 Feb 25.

The Lived Experience of Autistic Adults in Employment: A Systematic Search and Synthesis.成年自闭症患者的就业生活经历：系统检索与综述

Autism Adulthood. 2024 Dec 2;6(4):495-509. doi: 10.1089/aut.2022.0114. eCollection 2024 Dec.

Computational Sentence-Level Metrics of Reading Speed and Its Ramifications for Sentence Comprehension.阅读速度的计算句子级指标及其对句子理解的影响。

Cogn Sci. 2025 Jul;49(7):e70092. doi: 10.1111/cogs.70092.

Sexual Harassment and Prevention Training性骚扰与预防培训

Identifying and Addressing Bullying识别与应对霸凌

A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。

Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.

本文引用的文献

Language models outperform cloze predictability in a cognitive model of reading.语言模型在阅读认知模型中优于完形预测能力。

PLoS Comput Biol. 2024 Sep 25;20(9):e1012117. doi: 10.1371/journal.pcbi.1012117. eCollection 2024 Sep.

On the Mathematical Relationship Between Contextual Probability and N400 Amplitude.关于情境概率与N400波幅之间的数学关系。

Open Mind (Camb). 2024 Jun 28;8:859-897. doi: 10.1162/opmi_a_00150. eCollection 2024.

Strong Prediction: Language Model Surprisal Explains Multiple N400 Effects.强预测：语言模型意外值解释多种N400效应。

Neurobiol Lang (Camb). 2024 Apr 1;5(1):107-135. doi: 10.1162/nol_a_00105. eCollection 2024.

Large-scale evidence for logarithmic effects of word predictability on reading time.大规模证据表明，单词可预测性对阅读时间的影响呈对数关系。

Proc Natl Acad Sci U S A. 2024 Mar 5;121(10):e2307876121. doi: 10.1073/pnas.2307876121. Epub 2024 Feb 29.

Cloze probability, predictability ratings, and computational estimates for 205 English sentences, aligned with existing EEG and reading time data.205 个与现有 EEG 和阅读时间数据对齐的英语句子的 cloze 概率、可预测性评分和计算估计值。

Behav Res Methods. 2024 Aug;56(5):5190-5213. doi: 10.3758/s13428-023-02261-8. Epub 2023 Oct 25.

Multiple predictions during language comprehension: Friends, foes, or indifferent companions?语言理解中的多重预测：朋友、敌人，还是漠不关心的伙伴？

Cognition. 2023 Dec;241:105602. doi: 10.1016/j.cognition.2023.105602. Epub 2023 Sep 14.

Ignoring the alternatives: The N400 is sensitive to stimulus preactivation alone.忽略其他因素：N400仅对刺激预激活敏感。

Cortex. 2023 Nov;168:82-101. doi: 10.1016/j.cortex.2023.08.001. Epub 2023 Aug 14.

The Plausibility of Sampling as an Algorithmic Theory of Sentence Processing.抽样作为句子处理算法理论的合理性。

Open Mind (Camb). 2023 Jul 21;7:350-391. doi: 10.1162/opmi_a_00086. eCollection 2023.

Eye Movement Traces of Linguistic Knowledge in Native and Non-Native Reading.母语和非母语阅读中语言知识的眼动轨迹

Open Mind (Camb). 2023 Jun 5;7:179-196. doi: 10.1162/opmi_a_00084. eCollection 2023.

A study on surprisal and semantic relatedness for eye-tracking data prediction.一项关于用于眼动追踪数据预测的意外性和语义相关性的研究。

Front Psychol. 2023 Feb 2;14:1112365. doi: 10.3389/fpsyg.2023.1112365. eCollection 2023.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

人类与机器的句子处理：大型语言模型作为更好理解人类阅读的工具

Sentence processing by humans and machines: Large language models as a tool to better understand human reading.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献