Stilp Christian E, Assgari Ashley A
Department of Psychological and Brain Sciences, University of Louisville, 317 Life Sciences Building, Louisville, KY, 40292, USA.
Atten Percept Psychophys. 2018 Jul;80(5):1300-1310. doi: 10.3758/s13414-018-1488-9.
Speech perception is heavily influenced by surrounding sounds. When spectral properties differ between earlier (context) and later (target) sounds, this can produce spectral contrast effects (SCEs) that bias perception of later sounds. For example, when context sounds have more energy in low-F frequency regions, listeners report more high-F responses to a target vowel, and vice versa. SCEs have been reported using various approaches for a wide range of stimuli, but most often, large spectral peaks were added to the context to bias speech categorization. This obscures the lower limit of perceptual sensitivity to spectral properties of earlier sounds, i.e., when SCEs begin to bias speech categorization. Listeners categorized vowels (/ɪ/-/ɛ/, Experiment 1) or consonants (/d/-/g/, Experiment 2) following a context sentence with little spectral amplification (+1 to +4 dB) in frequency regions known to produce SCEs. In both experiments, +3 and +4 dB amplification in key frequency regions of the context produced SCEs, but lesser amplification was insufficient to bias performance. This establishes a lower limit of perceptual sensitivity where spectral differences across sounds can bias subsequent speech categorization. These results are consistent with proposed adaptation-based mechanisms that potentially underlie SCEs in auditory perception.
Recent sounds can change what speech sounds we hear later. This can occur when the average frequency composition of earlier sounds differs from that of later sounds, biasing how they are perceived. These "spectral contrast effects" are widely observed when sounds' frequency compositions differ substantially. We reveal the lower limit of these effects, as +3 dB amplification of key frequency regions in earlier sounds was enough to bias categorization of the following vowel or consonant sound. Speech categorization being biased by very small spectral differences across sounds suggests that spectral contrast effects occur frequently in everyday speech perception.
言语感知会受到周围声音的严重影响。当前期(背景)声音和后期(目标)声音的频谱特性不同时,这会产生频谱对比效应(SCEs),从而使对后期声音的感知产生偏差。例如,当背景声音在低频区域具有更多能量时,听众对目标元音的高频响应会更多,反之亦然。已经使用各种方法针对广泛的刺激报告了频谱对比效应,但最常见的是,在背景中添加大的频谱峰值以偏向言语分类。这掩盖了对前期声音频谱特性的感知敏感性下限,即频谱对比效应何时开始使言语分类产生偏差。听众在一个背景句子之后对元音(/ɪ/-/ɛ/,实验1)或辅音(/d/-/g/,实验2)进行分类,在已知会产生频谱对比效应的频率区域中频谱放大很小(+1至+4分贝)。在两个实验中,背景关键频率区域中+3和+4分贝的放大产生了频谱对比效应,但较小的放大不足以使表现产生偏差。这确定了感知敏感性的下限,即声音之间的频谱差异可以使后续言语分类产生偏差的下限。这些结果与所提出的基于适应的机制一致,这些机制可能是听觉感知中频谱对比效应的基础。
近期听到的声音可以改变我们随后听到的语音。当前期声音的平均频率组成与后期声音不同时,就会发生这种情况,从而使它们的感知方式产生偏差。当声音的频率组成有很大差异时,这些“频谱对比效应”被广泛观察到。我们揭示了这些效应的下限,因为前期声音中关键频率区域的+3分贝放大足以使后续元音或辅音声音的分类产生偏差。语音分类因声音之间非常小的频谱差异而产生偏差表明,频谱对比效应在日常言语感知中经常发生。