Kewley-Port Diane, Goodman Shawn S
Department of Speech and Hearing Sciences, Indiana University, Bloomington, Indiana 47405, USA.
J Acoust Soc Am. 2005 Nov;118(5):3252-60. doi: 10.1121/1.2074667.
Formant dynamics in vowel nuclei contribute to vowel classification in English. This study examined listeners' ability to discriminate dynamic second formant transitions in synthetic high front vowels. Acoustic measurements were made from the nuclei (steady state and 20% and 80% of vowel duration) for the vowels /i, I, e, epsilon, ae/ spoken by a female in /bVd/ context. Three synthesis parameters were selected to yield twelve discrimination conditions: initial frequency value for F2 (2525, 2272, or 2068 Hz), slope direction (rising or falling), and duration (110 or 165 ms). F1 frequency was roved. In the standard stimuli, F0 and F1-F4 were steady state. In the comparison stimuli only F2 frequency varied linearly to reach a final frequency. Five listeners were tested under adaptive tracking to estimate the threshold for frequency extent, the minimal detectable difference in frequency between the initial and final F2 values, called deltaF extent. Analysis showed that initial F2 frequency and direction of movement for some F2 frequencies contributed to significant differences in deltaF extent. Results suggested that listeners attended to differences in the stimulus property of frequency extent (hertz), not formant slope (hertz/second). Formant extent thresholds were at least four times smaller than extents measured in the natural speech tokens, and 18 times smaller than for the diphthongized vowel /e/.
元音核中的共振峰动态变化有助于英语中的元音分类。本研究考察了听众辨别合成高前元音中动态第二共振峰过渡的能力。对一名女性在 /bVd/ 语境中说出的元音 /i、I、e、epsilon、ae/ 的核(稳态以及元音时长的20%和80%)进行了声学测量。选择了三个合成参数以产生十二种辨别条件:F2的初始频率值(2525、2272或2068赫兹)、斜率方向(上升或下降)以及时长(110或165毫秒)。F1频率是随机变化的。在标准刺激中,F0以及F1 - F4处于稳态。在比较刺激中,只有F2频率呈线性变化以达到最终频率。五名听众在自适应跟踪条件下接受测试,以估计频率范围阈值,即初始和最终F2值之间频率的最小可检测差异,称为deltaF范围。分析表明,初始F2频率以及某些F2频率的移动方向导致了deltaF范围的显著差异。结果表明,听众关注频率范围(赫兹)这一刺激属性的差异,而非共振峰斜率(赫兹/秒)。共振峰范围阈值至少比在自然语音样本中测得的范围小四倍,比双元音化的元音 /e/ 的范围小十八倍。