Center for Language and Speech Processing, Johns Hopkins University, Baltimore, Maryland 21218, USA.
J Acoust Soc Am. 2012 Oct;132(4):2663-75. doi: 10.1121/1.4747008.
In a previous study on plosives, the 3-Dimensional Deep Search (3DDS) method for the exploration of the necessary and sufficient cues for speech perception was introduced (Li et al., (2010). J. Acoust. Soc. Am. 127(4), 2599-2610). Here, this method is used to isolate the spectral cue regions for perception of the American English fricatives /∫, 3, s, z, f, v, θ, δ in time, frequency, and intensity. The fricatives are analyzed in the context of consonant-vowel utterances, using the vowel /α/. The necessary cues were found to be contained in the frication noise for /∫, 3, s, z, f, v/. 3DDS analysis isolated the cue regions of /s, z/ between 3.6 and 8 [kHz] and /∫, 3/ between 1.4 and 4.2 [kHz]. Some utterances were found to contain acoustic components that were unnecessary for correct perception, but caused listeners to hear non-target consonants when the primary cue region was removed; such acoustic components are labeled "conflicting cue regions." The amplitude modulation of the high-frequency frication region by the fundamental F0 was found to be a sufficient cue for voicing. Overall, the 3DDS method allows one to analyze the effects of natural speech components without initial assumptions about where perceptual cues lie in time-frequency space or which elements of production they correspond to.
在之前关于爆破音的研究中,介绍了 3DDS(三维深度搜索)方法,用于探索语音感知所需的充分线索(Li 等人,(2010)。J. Acoust. Soc. Am. 127(4),2599-2610)。在这里,该方法用于在时间、频率和强度上隔离感知美语摩擦音 /∫、3、s、z、f、v、θ、δ 的谱线索区域。摩擦音在辅音-元音发音中进行分析,使用元音 /α/。研究发现,/∫、3、s、z、f、v/ 的摩擦噪声中包含了必要的线索。3DDS 分析分离了 /s、z/ 之间的线索区域在 3.6 和 8 [kHz] 之间,/∫、3/ 在 1.4 和 4.2 [kHz] 之间。发现一些发音包含了对正确感知不必要的声学成分,但当主要线索区域被移除时,会导致听众听到非目标辅音;这些声学成分被标记为“冲突线索区域”。高频摩擦区域的基频 F0 调幅被发现是浊音的充分线索。总的来说,3DDS 方法允许人们在不预先假设感知线索在时频空间中的位置或它们对应于产生的哪些元素的情况下,分析自然语音成分的影响。