Suppr超能文献

元音的听觉感知解读

Auditory-perceptual interpretation of the vowel.

作者信息

Miller J D

机构信息

Central Institute for the Deaf, St. Louis, Missouri 63110.

出版信息

J Acoust Soc Am. 1989 May;85(5):2114-34. doi: 10.1121/1.397862.

Abstract

The major issues in relating acoustic waveforms of spoken vowels to perceived vowel categories are presented and discussed in terms of the author's auditory-perceptual theory of phonetic recognition. A brief historical review of formant-ratio theory is presented, as well as an analysis of frequency scales that have been proposed for description of the vowel. It is illustrated that the monophthongal vowel sounds of American English can be represented as clustered in perceptual target zones within a three-dimensional auditory-perceptual space (APS), and it is shown that preliminary versions of these target zones segregate a corpus of vowels of American English with 93% accuracy. Furthermore, it is shown that the nonretroflex vowels of American English fall within a narrow slab within the APS, with spread vowels near the front of this slab and rounded vowels near the back. Retroflex vowels fall in a distinct region behind the vowel slab. Descriptions of the vowels within the APS are shown to be correlated with their descriptions in terms of dimensions of articulation and timbre. Additionally, issues related to talker normalization, coarticulation effects, segmentation, pitch, transposition, and diphthongization are discussed.

摘要

本文根据作者的语音识别听觉-感知理论,介绍并讨论了将口语元音的声学波形与感知到的元音类别相关联的主要问题。文中简要回顾了共振峰比率理论,并分析了为描述元音而提出的频率标度。结果表明,美国英语的单元音可以表示为聚集在三维听觉-感知空间(APS)中的感知目标区域内,并且这些目标区域的初步版本能够以93%的准确率区分美国英语元音语料库。此外,研究表明,美国英语的非卷舌元音落在APS内的一个狭窄平板区域内,其中展唇元音靠近该平板区域的前部,圆唇元音靠近后部。卷舌元音落在元音平板区域后面的一个独特区域。APS内元音的描述与它们在发音和音色维度方面的描述相关。此外,还讨论了与说话者归一化、协同发音效应、分割、音高、换位和双元音化相关的问题。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验