Blumstein S E, Stevens K N
J Acoust Soc Am. 1980 Feb;67(2):648-62. doi: 10.1121/1.383890.
A series of listening tests with brief synthetic consonant-vowel syllables was carried out to determine whether the initial part of a syllable can provide cues to place of articulation for voiced stop consonants independent of the remainder of the syllable. The data show that stimuli as short as 10-20 ms sampled from the onset of a consonant-vowel syllable, can be reliably identified for consonantal place of articulation, whether the second and higher formants contain moving or straight transitions and whether or not an initial burst is present. In most instances, these brief stimuli also contain sufficient information for vowel indentification. Stimulus continua in which formant transitions ranged from values appropriate to [b], [d], [g] in various vowel environments, and in which stimulus durations were 20 and 46 ms, yielded categorical labeling functions with a few exceptions. These results are consistent with a theory of speech perception in which consonant place of articulation is cued by invariant properties derived from the spectrum sampled in a 10-20 ms time window adjacent to consonantal onset or offset.
进行了一系列使用简短合成辅音-元音音节的听力测试,以确定音节的起始部分是否能够独立于音节的其余部分,为浊塞音的发音部位提供线索。数据表明,从辅音-元音音节的起始处采样的短至10 - 20毫秒的刺激,无论第二和更高共振峰包含移动还是直线过渡,以及是否存在初始爆破音,都能够可靠地识别辅音的发音部位。在大多数情况下,这些简短的刺激也包含足够的信息用于元音识别。共振峰过渡范围从各种元音环境中适合[b]、[d]、[g]的值,且刺激持续时间为20和46毫秒的刺激连续体,除了少数例外情况,产生了类别标记函数。这些结果与一种语音感知理论一致,在该理论中,辅音的发音部位由在与辅音起始或偏移相邻的10 - 20毫秒时间窗口内采样的频谱所派生的不变属性来提示。