Suppr超能文献

从彼得森和巴尼的测量数据中合成的稳态元音的识别。

Identification of steady-state vowels synthesized from the Peterson and Barney measurements.

作者信息

Hillenbrand J, Gayvert R T

机构信息

Department of Speech Pathology and Audiology, Western Michigan University, Kalamazoo 49008-3825.

出版信息

J Acoust Soc Am. 1993 Aug;94(2 Pt 1):668-74. doi: 10.1121/1.406884.

Abstract

The purpose of this study was to determine how well listeners can identify vowels based exclusively on static spectral cues. This was done by asking listeners to identify steady-state synthesized versions of 1520 vowels (76 talkers x 10 vowels x 2 repetitions) using Peterson and Barney's measured values of F0 and F1-F3 [J. Acoust. Soc. Am. 24, 175-184 (1952)]. The values for all control parameters remained constant throughout the 300-ms duration of each stimulus. A second set of 1520 signals was identical to these stimuli except that a falling pitch contour was used. The identification error rate for the flat-formant, flat-pitch signals was 27.3%, several times greater than the 5.6% error rate shown by Peterson and Barney's listeners. The introduction of a falling pitch contour resulted in a small but statistically reliable reduction in the error rate. The implications of these results for interpreting pattern recognition studies using the Peterson and Barney database are discussed. Results are also discussed in relation to the role of dynamic cues in vowel identification.

摘要

本研究的目的是确定听者仅根据静态频谱线索识别元音的能力有多强。这是通过要求听者使用彼得森和巴尼测量的F0以及F1 - F3值[《美国声学学会杂志》24, 175 - 184 (1952)]来识别1520个元音(76位说话者×10个元音×2次重复)的稳态合成版本来实现的。在每个刺激的300毫秒持续时间内,所有控制参数的值保持不变。第二组1520个信号与这些刺激相同,只是使用了下降的音高轮廓。对于共振峰平坦、音高平坦的信号,识别错误率为27.3%,比彼得森和巴尼的听者所显示的5.6%的错误率高出几倍。引入下降的音高轮廓导致错误率有小幅但在统计学上可靠的降低。讨论了这些结果对于解释使用彼得森和巴尼数据库的模式识别研究的意义。还讨论了与动态线索在元音识别中的作用相关的结果。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验