Department of Psychology, University of Alberta, Edmonton, Alberta, Canada.
Department of Mechanical Engineering, University of Alberta, Edmonton, Alberta, Canada.
Behav Res Methods. 2021 Oct;53(5):2214-2225. doi: 10.3758/s13428-021-01558-w. Epub 2021 Apr 1.
In this paper our goal is to undertake a systematic assessment of the first, most widely known, and simplest computational model of metaphor comprehension, the predication model developed by Kintsch (Cognitive Science, 25(2), 173-202, 2000). 622 metaphors of the form "x is a y" were selected from a much larger set generated randomly. The metaphors were judged for quality using best/worst judgments, which asks judges to pick the best and worst metaphor from among four presented metaphors. The metaphors and their judgments have been publicly released. We modeled the judgments by extending Kintsch's predication model (2000) by systematically walking through the parameter space of that model. Our model successfully differentiated metaphors rated as good (> 1.5z) from metaphors rated as bad (< -1.5z; Cohen's d = 0.72) and was able to successfully classify good metaphors with an accuracy of 82.9%. However, it achieved a true negative rate below chance at 36.3% and had a resultantly low kappa of 0.037. The model could not distinguish unselected random metaphors from those selected by humans as having metaphorical potential. In a follow-up study we showed that the model's quality estimates reliably predict metaphor decision times, with better metaphors being judged more quickly than worse metaphors.
本文的目的是对隐喻理解的第一个、最广为人知且最简单的计算模型——Kintsch(认知科学,25(2),173-202,2000)提出的命题模型——进行系统评估。我们从一组更大的随机生成的集合中选择了 622 个形式为“x 是 y”的隐喻。使用最佳/最差判断来判断隐喻的质量,这要求评判者从四个呈现的隐喻中选择最好和最差的隐喻。这些隐喻及其判断已经公开。我们通过系统地遍历该模型的参数空间,扩展了 Kintsch 的命题模型(2000 年)来对这些判断进行建模。我们的模型成功地区分了评分高于 1.5z 的好隐喻和评分低于-1.5z 的差隐喻(Cohen 的 d=0.72),并能够以 82.9%的准确率成功分类好隐喻。然而,它的真阴性率低于 36.3%,kappa 值仅为 0.037。该模型无法区分人类选择的具有隐喻潜力的隐喻和未选中的随机隐喻。在后续研究中,我们表明模型的质量估计可靠地预测了隐喻决策时间,更好的隐喻比更差的隐喻被判断得更快。