Department of Linguistics, Cornell University.
Department of Linguistics and Center for Data Science, New York University.
Cogn Sci. 2021 Jun;45(6):e12988. doi: 10.1111/cogs.12988.
The disambiguation of a syntactically ambiguous sentence in favor of a less preferred parse can lead to slower reading at the disambiguation point. This phenomenon, referred to as a garden-path effect, has motivated models in which readers initially maintain only a subset of the possible parses of the sentence, and subsequently require time-consuming reanalysis to reconstruct a discarded parse. A more recent proposal argues that the garden-path effect can be reduced to surprisal arising in a fully parallel parser: words consistent with the initially dispreferred but ultimately correct parse are simply less predictable than those consistent with the incorrect parse. Since predictability has pervasive effects in reading far beyond garden-path sentences, this account, which dispenses with reanalysis mechanisms, is more parsimonious. Crucially, it predicts a linear effect of surprisal: the garden-path effect is expected to be proportional to the difference in word surprisal between the ultimately correct and ultimately incorrect interpretations. To test this prediction, we used recurrent neural network language models to estimate word-by-word surprisal for three temporarily ambiguous constructions. We then estimated the slowdown attributed to each bit of surprisal from human self-paced reading times, and used that quantity to predict syntactic disambiguation difficulty. Surprisal successfully predicted the existence of garden-path effects, but drastically underpredicted their magnitude, and failed to predict their relative severity across constructions. We conclude that a full explanation of syntactic disambiguation difficulty may require recovery mechanisms beyond predictability.
一个句法上有歧义的句子向不太受欢迎的解析的消歧,可能会导致在消歧点的阅读速度变慢。这种现象被称为花园路径效应,它促使人们提出了这样的模型:读者最初只保留句子可能解析的一个子集,然后需要耗时的重新分析来重构被丢弃的解析。最近的一个提议认为,花园路径效应可以归结为完全并行解析器中产生的惊讶度:与最初不喜欢但最终正确的解析一致的词,其可预测性比与错误解析一致的词要低。由于可预测性在阅读中具有广泛的影响,远远超出了花园路径句子的范围,因此这种不需要重新分析机制的解释更加简洁。至关重要的是,它预测了惊讶度的线性效应:花园路径效应预计与最终正确和最终错误解释之间的单词惊讶度差异成正比。为了验证这一预测,我们使用递归神经网络语言模型来估计三个暂时有歧义的结构的逐字惊讶度。然后,我们从人类的自我调整阅读时间中估计每个惊讶度比特的减速,并使用该数量来预测句法歧义消解的难度。惊讶度成功地预测了花园路径效应的存在,但大大低估了其幅度,并且未能预测不同结构之间的相对严重程度。我们得出结论,对句法歧义消解难度的全面解释可能需要超出可预测性的恢复机制。