元音识别：正字法、感知和声学方面。

Vowel identification: orthographic, perceptual, and acoustic aspects.

作者信息

Assmann P F, Nearey T M, Hogan J T

出版信息

J Acoust Soc Am. 1982 Apr;71(4):975-89. doi: 10.1121/1.387579.

Abstract

This study investigates conditions under which vowels are well recognized and relates perceptual identification of individual tokens to acoustic characteristics. Results support recent finding that isolated vowels may be readily identified by listeners. Two experiments provided evidence that certain response tasks result in inflated error rates. Subsequent experiments showed improved identification in a fixed speaker context, compared with randomized speakers, for isolated vowels and gated centers. Performance was worse for gated vowels, suggesting that dynamic properties (such as duration and diphthongization) supplement steady-state cues. However, even-speaker-randomized gated vowels were well identified (14% errors). Measures of "steady-state information" (formant frequencies and f0), "dynamic information" (formant slopes and duration), and "speaker information" (normalization) were adopted. Discriminant analyses of acoustic measurements indicated relatively little overlap between vowel categories. Using a new technique for relating acoustic measurements of individual tokens with identification by listeners, it is shown that (a) identification performance is clearly related to acoustic characteristics; (b) improvement in the fixed speaker context is correlated with improved statistical separation resulting from formant normalization, for the gated vowels; and (c) "dynamic information" is related to identification differences between full and gated isolated vowels.

摘要

本研究调查了元音被良好识别的条件，并将单个音素的感知识别与声学特征联系起来。结果支持了最近的一项发现，即孤立的元音可能很容易被听众识别。两项实验提供了证据，表明某些反应任务会导致错误率虚高。随后的实验表明，与随机安排说话者相比，在固定说话者的情境中，孤立元音和音门塞音中心的识别有所改善。音门塞音化元音的表现较差，这表明动态特性（如时长和双元音化）补充了稳态线索。然而，即使说话者随机排列的音门塞音化元音也能被很好地识别（错误率为14%）。采用了“稳态信息”（共振峰频率和基频）、“动态信息”（共振峰斜率和时长）以及“说话者信息”（归一化）的测量方法。声学测量的判别分析表明元音类别之间的重叠相对较少。使用一种将单个音素的声学测量与听众识别联系起来的新技术，结果表明：（a）识别性能与声学特征明显相关；（b）对于音门塞音化元音，在固定说话者情境中的改善与共振峰归一化导致的统计分离改善相关；（c）“动态信息”与完整孤立元音和音门塞音化孤立元音之间的识别差异有关。

相似文献

Vowel identification: orthographic, perceptual, and acoustic aspects.

J Acoust Soc Am. 1982 Apr;71(4):975-89. doi: 10.1121/1.387579.

Dynamic specification of coarticulated German vowels: perceptual and acoustical studies.

J Acoust Soc Am. 1998 Jul;104(1):488-504. doi: 10.1121/1.423299.

Static, dynamic, and relational properties in vowel perception.

J Acoust Soc Am. 1989 May;85(5):2088-113. doi: 10.1121/1.397861.

Perceptual separation of simultaneous vowels: within and across-formant grouping by F0.

J Acoust Soc Am. 1993 Jun;93(6):3454-67. doi: 10.1121/1.405675.

The relative contributions of speaking fundamental frequency and formant frequencies to gender identification based on isolated vowels.

J Voice. 2005 Dec;19(4):544-54. doi: 10.1016/j.jvoice.2004.10.006.

Perception of vowels and prosody by cochlear implant recipients in noise.

J Commun Disord. 2013 Sep-Dec;46(5-6):449-64. doi: 10.1016/j.jcomdis.2013.09.002. Epub 2013 Sep 21.

On the sufficiency of compound target specification of isolated vowels and vowels in /bVb/ syllables.

J Acoust Soc Am. 1992 Jan;91(1):390-410. doi: 10.1121/1.402781.

Acoustic analysis of vowel formant frequencies in genetically-related and non-genetically related speakers with implications for forensic speaker comparison.

PLoS One. 2021 Feb 18;16(2):e0246645. doi: 10.1371/journal.pone.0246645. eCollection 2021.

Identification of coarticulated vowels.

J Acoust Soc Am. 1980 Dec;68(6):1626-35. doi: 10.1121/1.385218.

Dynamic information in the identification and discrimination of vowels.

Phonetica. 1989;46(1-3):97-116. doi: 10.1159/000261831.

引用本文的文献

Evaluating normalization accounts against the dense vowel space of Central Swedish.

Front Psychol. 2023 Jun 21;14:1165742. doi: 10.3389/fpsyg.2023.1165742. eCollection 2023.

Talker adaptation or "talker" adaptation? Musical instrument variability impedes pitch perception.

Atten Percept Psychophys. 2023 Oct;85(7):2488-2501. doi: 10.3758/s13414-023-02722-4. Epub 2023 May 31.

Clearly, fame isn't everything: Talker familiarity does not augment talker adaptation.

Atten Percept Psychophys. 2024 Apr;86(3):962-975. doi: 10.3758/s13414-022-02615-y. Epub 2022 Nov 23.

A New Proposal for Phoneme Acquisition: Computing Speaker-Specific Distribution.

Brain Sci. 2021 Feb 1;11(2):177. doi: 10.3390/brainsci11020177.

Acoustic-phonetic and auditory mechanisms of adaptation in the perception of sibilant fricatives.

Atten Percept Psychophys. 2020 May;82(4):2027-2048. doi: 10.3758/s13414-019-01894-2.

Noninvasive neurostimulation of left temporal lobe disrupts rapid talker adaptation in speech processing.

Brain Lang. 2019 Sep;196:104655. doi: 10.1016/j.bandl.2019.104655. Epub 2019 Jul 13.

Time and information in perceptual adaptation to speech.

Cognition. 2019 Nov;192:103982. doi: 10.1016/j.cognition.2019.05.019. Epub 2019 Jun 21.

Varying acoustic-phonemic ambiguity reveals that talker normalization is obligatory in speech processing.

Atten Percept Psychophys. 2018 Apr;80(3):784-797. doi: 10.3758/s13414-017-1395-5.

Speaker and Accent Variation Are Handled Differently: Evidence in Native and Non-Native Listeners.

PLoS One. 2016 Jun 16;11(6):e0156870. doi: 10.1371/journal.pone.0156870. eCollection 2016.

A general auditory bias for handling speaker variability in speech? Evidence in humans and songbirds.

Front Psychol. 2015 Aug 25;6:1243. doi: 10.3389/fpsyg.2015.01243. eCollection 2015.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

元音识别：正字法、感知和声学方面。

Vowel identification: orthographic, perceptual, and acoustic aspects.

作者信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献