Department of Psychology, University of Milano - Bicocca, Piazza dell'Ateneo Nuovo 1, Milano, MI 20126, Italy.
Behav Res Methods. 2024 Aug;56(5):5190-5213. doi: 10.3758/s13428-023-02261-8. Epub 2023 Oct 25.
We release a database of cloze probability values, predictability ratings, and computational estimates for a sample of 205 English sentences (1726 words), aligned with previously released word-by-word reading time data (both self-paced reading and eye-movement records; Frank et al., Behavior Research Methods, 45(4), 1182-1190. 2013) and EEG responses (Frank et al., Brain and Language, 140, 1-11. 2015). Our analyses show that predictability ratings are the best predictors of the EEG signal (N400, P600, LAN) self-paced reading times, and eye movement patterns, when spillover effects are taken into account. The computational estimates are particularly effective at explaining variance in the eye-tracking data without spillover. Cloze probability estimates have decent overall psychometric accuracy and are the best predictors of early fixation patterns (first fixation duration). Our results indicate that the choice of the best measurement of word predictability in context critically depends on the processing index being considered.
我们发布了一个 cloze 概率值、可预测性评分和计算估计的数据库,其中包含 205 个英语句子(1726 个单词)的样本,与之前发布的逐字阅读时间数据(包括自我调节阅读和眼动记录;Frank 等人,行为研究方法,45(4),1182-1190. 2013)和 EEG 反应(Frank 等人,大脑与语言,140,1-11. 2015)对齐。我们的分析表明,当考虑溢出效应时,可预测性评分是 EEG 信号(N400、P600、LAN)自我调节阅读时间和眼动模式的最佳预测指标。计算估计在没有溢出的情况下特别有效地解释眼动跟踪数据的方差。 cloze 概率估计具有相当的整体心理测量准确性,是早期注视模式(首次注视持续时间)的最佳预测指标。我们的结果表明,在考虑到语境中的单词可预测性的最佳测量时,选择最佳测量方法取决于正在考虑的处理指标。