Université Catholique de Louvain, Louvain-la-Neuve, Belgium.
Behav Res Methods. 2012 Dec;44(4):998-1006. doi: 10.3758/s13428-012-0195-z.
In psychology, lexical norms related to the semantic properties of words, such as concreteness and valence, are important research resources. Collecting such norms by asking judges to rate the words is very time consuming, which strongly limits the number of words that compose them. In the present article, we present a technique for estimating lexical norms based on the latent semantic analysis of a corpus. The analyses conducted emphasize the technique's effectiveness for several semantic dimensions. In addition to the extension of norms, this technique can be used to check human ratings to identify words for which the rating is very different from the corpus-based estimate.
在心理学中,与词汇的语义属性(如具体性和情感性)相关的词汇规范是重要的研究资源。通过让评判者对词汇进行评分来收集这些规范非常耗时,这极大地限制了组成词汇规范的词汇数量。在本文中,我们提出了一种基于语料库潜在语义分析的词汇规范估计技术。所进行的分析强调了该技术在多个语义维度上的有效性。除了扩展规范外,该技术还可用于检查人类评分,以识别那些评分与基于语料库的估计值差异很大的词汇。