Assmann P F, Nearey T M, Hogan J T
J Acoust Soc Am. 1982 Apr;71(4):975-89. doi: 10.1121/1.387579.
This study investigates conditions under which vowels are well recognized and relates perceptual identification of individual tokens to acoustic characteristics. Results support recent finding that isolated vowels may be readily identified by listeners. Two experiments provided evidence that certain response tasks result in inflated error rates. Subsequent experiments showed improved identification in a fixed speaker context, compared with randomized speakers, for isolated vowels and gated centers. Performance was worse for gated vowels, suggesting that dynamic properties (such as duration and diphthongization) supplement steady-state cues. However, even-speaker-randomized gated vowels were well identified (14% errors). Measures of "steady-state information" (formant frequencies and f0), "dynamic information" (formant slopes and duration), and "speaker information" (normalization) were adopted. Discriminant analyses of acoustic measurements indicated relatively little overlap between vowel categories. Using a new technique for relating acoustic measurements of individual tokens with identification by listeners, it is shown that (a) identification performance is clearly related to acoustic characteristics; (b) improvement in the fixed speaker context is correlated with improved statistical separation resulting from formant normalization, for the gated vowels; and (c) "dynamic information" is related to identification differences between full and gated isolated vowels.
本研究调查了元音被良好识别的条件,并将单个音素的感知识别与声学特征联系起来。结果支持了最近的一项发现,即孤立的元音可能很容易被听众识别。两项实验提供了证据,表明某些反应任务会导致错误率虚高。随后的实验表明,与随机安排说话者相比,在固定说话者的情境中,孤立元音和音门塞音中心的识别有所改善。音门塞音化元音的表现较差,这表明动态特性(如时长和双元音化)补充了稳态线索。然而,即使说话者随机排列的音门塞音化元音也能被很好地识别(错误率为14%)。采用了“稳态信息”(共振峰频率和基频)、“动态信息”(共振峰斜率和时长)以及“说话者信息”(归一化)的测量方法。声学测量的判别分析表明元音类别之间的重叠相对较少。使用一种将单个音素的声学测量与听众识别联系起来的新技术,结果表明:(a)识别性能与声学特征明显相关;(b)对于音门塞音化元音,在固定说话者情境中的改善与共振峰归一化导致的统计分离改善相关;(c)“动态信息”与完整孤立元音和音门塞音化孤立元音之间的识别差异有关。