Faculty of Psychology and Educational Sciences, University of Leuven, Tiensestraat 102, 3000, Leuven, Belgium.
Behav Res Methods. 2013 Jun;45(2):480-98. doi: 10.3758/s13428-012-0260-7.
In this article, we describe the most extensive set of word associations collected to date. The database contains over 12,000 cue words for which more than 70,000 participants generated three responses in a multiple-response free association task. The goal of this study was (1) to create a semantic network that covers a large part of the human lexicon, (2) to investigate the implications of a multiple-response procedure by deriving a weighted directed network, and (3) to show how measures of centrality and relatedness derived from this network predict both lexical access in a lexical decision task and semantic relatedness in similarity judgment tasks. First, our results show that the multiple-response procedure results in a more heterogeneous set of responses, which lead to better predictions of lexical access and semantic relatedness than do single-response procedures. Second, the directed nature of the network leads to a decomposition of centrality that primarily depends on the number of incoming links or in-degree of each node, rather than its set size or number of outgoing links. Both studies indicate that adequate representation formats and sufficiently rich data derived from word associations represent a valuable type of information in both lexical and semantic processing.
在本文中,我们描述了迄今为止收集到的最广泛的词汇联想集。该数据库包含超过 12000 个提示词,超过 70000 名参与者在多项自由联想任务中对每个提示词生成了三个反应。本研究的目的是:(1)创建一个覆盖人类词汇大部分的语义网络;(2)通过得出一个加权有向网络,研究多项反应程序的含义;(3)展示从这个网络中得出的中心性和关联性度量如何预测词汇判断任务中的词汇访问和相似性判断任务中的语义关联性。首先,我们的结果表明,多项反应程序会产生更多样化的反应,这些反应比单项反应程序能更好地预测词汇访问和语义关联性。其次,网络的有向性质导致中心性的分解主要取决于每个节点的入站链接数量或入度,而不是其集合大小或出站链接数量。这两项研究都表明,充分的表示格式和从词汇联想中提取的丰富数据在词汇和语义处理中代表了一种有价值的信息类型。