Humboldt-Universität zu Berlin, Berlin, Germany.
University of Milano-Bicocca, Milan, Italy; NeuroMI, Milan Center for Neuroscience, Milan, Italy.
Cogn Psychol. 2022 May;134:101471. doi: 10.1016/j.cogpsych.2022.101471. Epub 2022 Mar 24.
While distributional semantic models that represent word meanings as high-dimensional vectors induced from large text corpora have been shown to successfully predict human behavior across a wide range of tasks, they have also received criticism from different directions. These include concerns over their interpretability (how can numbers specifying abstract, latent dimensions represent meaning?) and their ability to capture variation in meaning (how can a single vector representation capture multiple different interpretations for the same expression?). Here, we demonstrate that semantic vectors can indeed rise up to these challenges, by training a mapping system (a simple linear regression) that predicts inter-individual variation in relational interpretations for compounds such as wood brush (for example brush FOR wood, or brush MADE OF wood) from (compositional) semantic vectors representing the meanings of these compounds. These predictions consistently beat different random baselines, both for familiar compounds (moon light, Experiment 1) as well as novel compounds (wood brush, Experiment 2), demonstrating that distributional semantic vectors encode variations in qualitative interpretations that can be decoded using techniques as simple as linear regression.
虽然基于大型文本语料库的高维向量来表示单词含义的分布语义模型已被证明可以成功地预测各种任务中的人类行为,但它们也受到了来自不同方向的批评。这些批评包括对其可解释性的担忧(如何用指定抽象、潜在维度的数字来表示意义?)和其捕捉意义变化的能力(如何用单个向量表示来捕捉同一个表达式的多个不同解释?)。在这里,我们通过训练一个映射系统(简单的线性回归)来证明语义向量确实可以应对这些挑战,该系统可以从表示这些化合物含义的(组合的)语义向量中预测化合物(例如 wood brush,例如 brush FOR wood 或 brush MADE OF wood)的关系解释在个体间的变化。这些预测在熟悉的化合物(moon light,实验 1)和新颖的化合物(wood brush,实验 2)上都明显优于不同的随机基线,证明了分布语义向量编码了可以使用像线性回归这样简单的技术来解码的定性解释的变化。