Persson Anna, Jaeger T Florian
Department of Swedish Language and Multilingualism, Stockholm University, Stockholm, Sweden.
Brain and Cognitive Sciences, University of Rochester, Rochester, NY, United States.
Front Psychol. 2023 Jun 21;14:1165742. doi: 10.3389/fpsyg.2023.1165742. eCollection 2023.
Talkers vary in the phonetic realization of their vowels. One influential hypothesis holds that listeners overcome this inter-talker variability through pre-linguistic auditory mechanisms that normalize the acoustic or phonetic cues that form the input to speech recognition. Dozens of competing normalization accounts exist-including both accounts specific to vowel perception and general purpose accounts that can be applied to any type of cue. We add to the cross-linguistic literature on this matter by comparing normalization accounts against a new phonetically annotated vowel database of Swedish, a language with a particularly dense vowel inventory of 21 vowels differing in quality and quantity. We evaluate normalization accounts on how they differ in predicted consequences for perception. The results indicate that the best performing accounts either center or standardize formants by talker. The study also suggests that general purpose accounts perform as well as vowel-specific accounts, and that vowel normalization operates in both temporal and spectral domains.
说话者在元音的语音实现上存在差异。一种有影响力的假设认为,听者通过语言前的听觉机制克服这种说话者间的差异,这些机制会对构成语音识别输入的声学或语音线索进行归一化处理。存在数十种相互竞争的归一化解释,包括特定于元音感知的解释和可应用于任何类型线索的通用解释。我们通过将归一化解释与一个新的瑞典语音标注释元音数据库进行比较,为这一问题的跨语言文献增添了内容,瑞典语拥有特别丰富的元音系统,有21个元音,在音质和音长上各不相同。我们根据归一化解释在感知预测结果上的差异对其进行评估。结果表明,表现最佳的解释要么以说话者为中心对共振峰进行归一化,要么对共振峰进行标准化。该研究还表明,通用解释的表现与特定于元音的解释一样好,并且元音归一化在时间和频谱域中都起作用。