Faculty of English, Neuroscience of Language Laboratory, Adam Mickiewicz University, Poznań, Poland.
Faculty of English, Department of Psycholinguistic Studies, Adam Mickiewicz University, Poznań, Poland.
PLoS One. 2023 Apr 24;18(4):e0284801. doi: 10.1371/journal.pone.0284801. eCollection 2023.
This study presents a Polish semantic priming dataset and semantic similarity ratings for word pairs obtained with native Polish speakers, as well as a range of semantic spaces. The word pairs include strongly related, weakly related, and semantically unrelated word pairs. The rating study (Experiment 1) confirmed that the three conditions differed in semantic relatedness. The semantic priming lexical decision study with a carefully matched subset of the stimuli (Experiment 2), revealed strong semantic priming effects for strongly related word pairs, whereas weakly related word pairs showed a smaller but still significant priming effect relative to semantically unrelated word pairs. The datasets of both experiments and those of SimLex-999 for Polish were then used in a robust semantic model selection from existing and newly trained semantic spaces. This database of semantic vectors, semantic relatedness ratings, and behavioral data collected for all word pairs enable future researchers to benchmark new vectors against this dataset. Furthermore, the new vectors are made freely available for researchers. Although similar semantically strongly and weakly related word pairs are available in other languages, this is the first freely available database for Polish, that combines measures of semantic distance and human data.
本研究提供了一个波兰语义启动数据集和波兰语母语者对词对的语义相似性评分,以及一系列语义空间。这些词对包括强相关、弱相关和语义上不相关的词对。评分研究(实验 1)证实,这三种条件在语义相关性上存在差异。使用经过精心匹配的刺激子集进行的语义启动词汇判断研究(实验 2)揭示了强相关词对的强烈语义启动效应,而弱相关词对相对于语义上不相关的词对表现出较小但仍然显著的启动效应。两个实验的数据集以及波兰语的 SimLex-999 数据集随后被用于从现有的和新训练的语义空间中进行稳健的语义模型选择。这个语义向量、语义相关性评分和针对所有词对收集的行为数据的数据库使未来的研究人员能够将新向量与这个数据集进行基准测试。此外,新的向量可供研究人员免费使用。尽管在其他语言中也有类似的语义上强相关和弱相关词对,但这是第一个免费提供的波兰语结合语义距离和人类数据的数据库。