Centro de Investigación Nebrija en Cognición (CINC), Universidad Nebrija, Madrid, Spain.
UiT The Arctic University of Norway, Tromsø, Norway.
Sci Data. 2022 Jul 21;9(1):431. doi: 10.1038/s41597-022-01552-7.
The growing interdisciplinary research field of psycholinguistics is in constant need of new and up-to-date tools which will allow researchers to answer complex questions, but also expand on languages other than English, which dominates the field. One type of such tools are picture datasets which provide naming norms for everyday objects. However, existing databases tend to be small in terms of the number of items they include, and have also been normed in a limited number of languages, despite the recent boom in multilingualism research. In this paper we present the Multilingual Picture (Multipic) database, containing naming norms and familiarity scores for 500 coloured pictures, in thirty-two languages or language varieties from around the world. The data was validated with standard methods that have been used for existing picture datasets. This is the first dataset to provide naming norms, and translation equivalents, for such a variety of languages; as such, it will be of particular value to psycholinguists and other interested researchers. The dataset has been made freely available.
日益发展的心理语言学跨学科研究领域不断需要新的、最新的工具,这些工具将使研究人员能够回答复杂的问题,并且还可以扩展到英语以外的语言,因为目前该领域主要以英语为主导。这类工具之一是图片数据集,它为日常物品提供了命名规范。然而,现有的数据库在其所包含的项目数量方面往往规模较小,并且尽管近年来多语研究蓬勃发展,但也仅在有限的几种语言中进行了规范。在本文中,我们介绍了 Multilingual Picture (Multipic) 数据库,其中包含 500 张彩色图片的命名规范和熟悉度得分,涵盖了来自世界各地的三十二种语言或语言变体。该数据是通过已经用于现有图片数据集的标准方法进行验证的。这是第一个为如此多种语言提供命名规范和翻译对等物的数据集;因此,它将对心理语言学家和其他感兴趣的研究人员特别有价值。该数据集已经免费提供。