• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无监督文本分割可预测阅读过程中的眼动注视。

Unsupervised Text Segmentation Predicts Eye Fixations During Reading.

作者信息

Yang Jinbiao, van den Bosch Antal, Frank Stefan L

机构信息

Max Planck Institute for Psycholinguistics, Nijmegen, Netherlands.

Centre for Language Studies, Radboud University, Nijmegen, Netherlands.

出版信息

Front Artif Intell. 2022 Feb 23;5:731615. doi: 10.3389/frai.2022.731615. eCollection 2022.

DOI:10.3389/frai.2022.731615
PMID:35280234
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8905434/
Abstract

Words typically form the basis of psycholinguistic and computational linguistic studies about sentence processing. However, recent evidence shows the basic units during reading, i.e., the items in the mental lexicon, are not always words, but could also be sub-word and supra-word units. To recognize these units, human readers require a cognitive mechanism to learn and detect them. In this paper, we assume eye fixations during reading reveal the locations of the cognitive units, and that the cognitive units are analogous with the text units discovered by unsupervised segmentation models. We predict eye fixations by model-segmented units on both English and Dutch text. The results show the model-segmented units predict eye fixations better than word units. This finding suggests that the predictive performance of model-segmented units indicates their plausibility as cognitive units. The Less-is-Better (LiB) model, which finds the units that minimize both long-term and working memory load, offers advantages both in terms of prediction score and efficiency among alternative models. Our results also suggest that modeling the least-effort principle for the management of long-term and working memory can lead to inferring cognitive units. Overall, the study supports the theory that the mental lexicon stores not only words but also smaller and larger units, suggests that fixation locations during reading depend on these units, and shows that unsupervised segmentation models can discover these units.

摘要

单词通常构成了关于句子处理的心理语言学和计算语言学研究的基础。然而,最近的证据表明,阅读过程中的基本单位,即心理词典中的条目,并不总是单词,也可能是子词和超词单位。为了识别这些单位,人类读者需要一种认知机制来学习和检测它们。在本文中,我们假设阅读过程中的眼动注视揭示了认知单位的位置,并且认知单位与无监督分割模型发现的文本单位类似。我们用模型分割的单位对英语和荷兰语文本的眼动注视进行预测。结果表明,模型分割的单位比单词单位能更好地预测眼动注视。这一发现表明,模型分割单位的预测性能表明了它们作为认知单位的合理性。“少即是优”(LiB)模型,即找到能使长期记忆和工作记忆负荷最小化的单位,在预测得分和效率方面比其他模型都具有优势。我们的结果还表明,对管理长期记忆和工作记忆的最少努力原则进行建模可以推断出认知单位。总体而言,该研究支持了心理词典不仅存储单词,还存储更小和更大单位的理论,表明阅读过程中的注视位置取决于这些单位,并表明无监督分割模型可以发现这些单位。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c439/8905434/ae9cd6c7d556/frai-05-731615-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c439/8905434/c6f66f730450/frai-05-731615-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c439/8905434/eac05c523a53/frai-05-731615-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c439/8905434/e9ebb52bd9d8/frai-05-731615-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c439/8905434/ae9cd6c7d556/frai-05-731615-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c439/8905434/c6f66f730450/frai-05-731615-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c439/8905434/eac05c523a53/frai-05-731615-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c439/8905434/e9ebb52bd9d8/frai-05-731615-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c439/8905434/ae9cd6c7d556/frai-05-731615-g0004.jpg

相似文献

1
Unsupervised Text Segmentation Predicts Eye Fixations During Reading.无监督文本分割可预测阅读过程中的眼动注视。
Front Artif Intell. 2022 Feb 23;5:731615. doi: 10.3389/frai.2022.731615. eCollection 2022.
2
An integrated model of word processing and eye-movement control during Chinese reading.中文阅读过程中文字处理和眼动控制的综合模型。
Psychol Rev. 2020 Nov;127(6):1139-1162. doi: 10.1037/rev0000248. Epub 2020 Jul 16.
3
Using Statistical Models of Morphology in the Search for Optimal Units of Representation in the Human Mental Lexicon.利用形态学统计模型探寻人类心理词典中最优的表征单元
Cogn Sci. 2018 Apr;42(3):939-973. doi: 10.1111/cogs.12576. Epub 2017 Dec 19.
4
Neural dynamics of sentiment processing during naturalistic sentence reading.自然阅读句子时情绪处理的神经动力学。
Neuroimage. 2020 Sep;218:116934. doi: 10.1016/j.neuroimage.2020.116934. Epub 2020 May 13.
5
Optimal landing position in reading isolated words and continuous text.阅读孤立单词和连续文本时的最佳着陆位置。
Percept Psychophys. 1990 Jun;47(6):583-600. doi: 10.3758/bf03203111.
6
Do Chinese readers follow the national standard rules for word segmentation during reading?中国读者在阅读时是否遵循分词的国家标准规则?
PLoS One. 2013;8(2):e55440. doi: 10.1371/journal.pone.0055440. Epub 2013 Feb 8.
7
The declarative/procedural model of lexicon and grammar.词汇与语法的陈述性/程序性模型
J Psycholinguist Res. 2001 Jan;30(1):37-69. doi: 10.1023/a:1005204207369.
8
Size of saccade and fixation duration of eye movements during reading: psychophysics of Japanese text processing.阅读过程中眼跳的大小和注视持续时间:日语文本处理的心理物理学
J Opt Soc Am A. 1992 Jan;9(1):5-13. doi: 10.1364/josaa.9.000005.
9
Information influences the pattern of eye fixations during sentence comprehension.
Perception. 1988;17(2):267-78. doi: 10.1068/p170267.
10
Integrating information across eye fixations in reading: the role of letter and word units.阅读中整合眼动注视点信息:字母和单词单元的作用。
Acta Psychol (Amst). 1990 Apr;73(3):281-97. doi: 10.1016/0001-6918(90)90027-d.

引用本文的文献

1
Individual Differences in Holistic and Compositional Language Processing.整体与组合式语言处理中的个体差异。
J Cogn. 2023 Jun 28;6(1):29. doi: 10.5334/joc.283. eCollection 2023.

本文引用的文献

1
How Do We Segment Text? Two-Stage Chunking Operation in Reading.我们如何对文本进行分段?阅读中的两阶段分块操作。
eNeuro. 2020 Jun 11;7(3). doi: 10.1523/ENEURO.0425-19.2020. Print 2020 May/Jun.
2
Hierarchical structure guides rapid linguistic predictions during naturalistic listening.层级结构指导自然聆听中的快速语言预测。
PLoS One. 2019 Jan 16;14(1):e0207741. doi: 10.1371/journal.pone.0207741. eCollection 2019.
3
Language learning as language use: A cross-linguistic model of child language development.语言学习即语言使用:儿童语言发展的跨语言模型。
Psychol Rev. 2019 Jan;126(1):1-51. doi: 10.1037/rev0000126.
4
Morphological processing in the brain: The good (inflection), the bad (derivation) and the ugly (compounding).大脑中的形态加工:好的(屈折变化)、坏的(派生)和丑的(复合)。
Cortex. 2019 Jul;116:4-44. doi: 10.1016/j.cortex.2018.08.016. Epub 2018 Sep 1.
5
Do Morphemes Matter when Reading Compound Words with Transposed Letters? Evidence from Eye-Tracking and Event-Related Potentials.阅读字母顺序颠倒的复合词时词素重要吗?来自眼动追踪和事件相关电位的证据。
Lang Cogn Neurosci. 2016;31(10):1299-1319. doi: 10.1080/23273798.2016.1212082. Epub 2016 Aug 6.
6
Computational Investigations of Multiword Chunks in Language Learning.语言学习中多词块的计算研究
Top Cogn Sci. 2017 Jul;9(3):637-652. doi: 10.1111/tops.12258. Epub 2017 May 8.
7
Presenting GECO: An eyetracking corpus of monolingual and bilingual sentence reading.介绍GECO:一个单语和双语句子阅读的眼动追踪语料库。
Behav Res Methods. 2017 Apr;49(2):602-615. doi: 10.3758/s13428-016-0734-0.
8
Cortical tracking of hierarchical linguistic structures in connected speech.连贯言语中层次语言结构的皮层追踪。
Nat Neurosci. 2016 Jan;19(1):158-64. doi: 10.1038/nn.4186. Epub 2015 Dec 7.
9
Distinct morphological processing of recently learned compound words: An ERP study.
Brain Res. 2015 Dec 10;1629:309-17. doi: 10.1016/j.brainres.2015.10.029. Epub 2015 Oct 23.
10
Effects of word length on eye movement control: The evidence from Arabic.词长对眼动控制的影响:来自阿拉伯语的证据。
Psychon Bull Rev. 2015 Oct;22(5):1443-50. doi: 10.3758/s13423-015-0809-4.