Laboratoire Psychologie et Neurocognition, CNRS UMR 5105, University Savoie Mont Blanc (USMB), Chambéry, France.
Cognitive Neuroimaging Unit, INSERM, CEA, CNRS, Université Paris-Saclay, NeuroSpin Center, Gif/Yvette, France.
Q J Exp Psychol (Hove). 2024 Feb;77(2):278-286. doi: 10.1177/17470218231164373. Epub 2023 Apr 18.
Pseudowords are letter strings that look like words but are not words. They are used in psycholinguistic research, particularly in tasks such as lexical decision. In this context, it is essential that the pseudowords respect the orthographic statistics of the target language. Pseudowords that violate them would be too easy to reject in a lexical decision and would not enforce word recognition on real words. We propose a new pseudoword generator, UniPseudo, using an algorithm based on Markov chains of orthographic n-grams. It generates pseudowords from a customizable database, which allows one to control the characteristics of the items. It can produce pseudowords in any language, in orthographic or phonological form. It is possible to generate pseudowords with specific characteristics, such as frequency of letters, bigrams, trigrams, or quadrigrams, number of syllables, frequency of biphones, and number of morphemes. Thus, from a list of words composed of verbs, nouns, adjectives, or adverbs, UniPseudo can create pseudowords resembling verbs, nouns, adjectives, or adverbs in any language using an alphabetic or syllabic system.
假词是看起来像单词但实际上不是单词的字母串。它们在心理语言学研究中被广泛使用,特别是在词汇判断等任务中。在这种情况下,假词必须尊重目标语言的正字法统计数据。违反这些规则的假词在词汇判断中很容易被拒绝,并且不会对真实单词的识别产生影响。我们提出了一种新的假词生成器 UniPseudo,它使用基于正字法 n-gram 马尔可夫链的算法。它可以从可定制的数据库中生成假词,从而可以控制项目的特征。它可以生成任何语言的假词,无论是正字法形式还是语音形式。可以生成具有特定特征的假词,例如字母、双字母、三字母或四字母的频率、音节数、双音素的频率和语素的数量。因此,从由动词、名词、形容词或副词组成的单词列表中,UniPseudo 可以使用字母或音节系统在任何语言中创建类似于动词、名词、形容词或副词的假词。