Suppr超能文献

元音和辅音基频、包络和时域精细结构线索对单词和句子可懂度的作用。

The role of vowel and consonant fundamental frequency, envelope, and temporal fine structure cues to the intelligibility of words and sentences.

机构信息

Department of Speech and Hearing Sciences Indiana University Bloomington, Indiana 47405, USA.

出版信息

J Acoust Soc Am. 2012 Feb;131(2):1490-501. doi: 10.1121/1.3676696.

Abstract

The speech signal contains many acoustic properties that may contribute differently to spoken word recognition. Previous studies have demonstrated that the importance of properties present during consonants or vowels is dependent upon the linguistic context (i.e., words versus sentences). The current study investigated three potentially informative acoustic properties that are present during consonants and vowels for monosyllabic words and sentences. Natural variations in fundamental frequency were either flattened or removed. The speech envelope and temporal fine structure were also investigated by limiting the availability of these cues via noisy signal extraction. Thus, this study investigated the contribution of these acoustic properties, present during either consonants or vowels, to overall word and sentence intelligibility. Results demonstrated that all processing conditions displayed better performance for vowel-only sentences. Greater performance with vowel-only sentences remained, despite removing dynamic cues of the fundamental frequency. Word and sentence comparisons suggest that the speech envelope may be at least partially responsible for additional vowel contributions in sentences. Results suggest that speech information transmitted by the envelope is responsible, in part, for greater vowel contributions in sentences, but is not predictive for isolated words.

摘要

语音信号包含许多可能对口语识别有不同贡献的声学特性。先前的研究表明,在辅音或元音中出现的特性的重要性取决于语言环境(即单词与句子)。本研究调查了在单音节词和句子中出现的三个在辅音和元音中可能具有信息性的声学特性。基频的自然变化要么被平坦化,要么被去除。还通过通过噪声信号提取限制这些线索的可用性来研究语音包络和时间精细结构。因此,本研究调查了在辅音或元音中出现的这些声学特性对整体单词和句子可理解性的贡献。结果表明,所有处理条件在仅包含元音的句子中都表现出更好的性能。尽管去除了基频的动态线索,但仅包含元音的句子仍能保持更好的性能。单词和句子的比较表明,语音包络可能至少部分负责句子中元音的额外贡献。结果表明,由包络传输的语音信息部分负责句子中元音的更大贡献,但对孤立单词没有预测性。

相似文献

6
Acoustic predictors of intelligibility for segmentally interrupted speech: temporal envelope, voicing, and duration.
J Speech Lang Hear Res. 2013 Oct;56(5):1402-8. doi: 10.1044/1092-4388(2013/12-0203). Epub 2013 Jul 9.
8
Perceptual contributions of the consonant-vowel boundary to sentence intelligibility.
J Acoust Soc Am. 2009 Aug;126(2):847-57. doi: 10.1121/1.3159302.
9
Contribution of consonant landmarks to speech recognition in simulated acoustic-electric hearing.
Ear Hear. 2010 Apr;31(2):259-67. doi: 10.1097/AUD.0b013e3181c7db17.
10
Cochlea-scaled entropy, not consonants, vowels, or time, best predicts speech intelligibility.
Proc Natl Acad Sci U S A. 2010 Jul 6;107(27):12387-92. doi: 10.1073/pnas.0913625107. Epub 2010 Jun 21.

引用本文的文献

1
Speech sound discrimination in background noise across the lifespan: a comparative study in Mongolian gerbils and humans.
Front Aging Neurosci. 2025 Jun 9;17:1570305. doi: 10.3389/fnagi.2025.1570305. eCollection 2025.
4
Sound masking by a low-pitch speech-shaped noise improves a social robot's talk in noisy environments.
Front Robot AI. 2024 Jan 9;10:1205209. doi: 10.3389/frobt.2023.1205209. eCollection 2023.
6
Vowel and formant representation in the human auditory speech cortex.
Neuron. 2023 Jul 5;111(13):2105-2118.e4. doi: 10.1016/j.neuron.2023.04.004. Epub 2023 Apr 26.
7
Can Closed-Set Word Recognition Differentially Assess Vowel and Consonant Perception for School-Age Children With and Without Hearing Loss?
J Speech Lang Hear Res. 2022 Oct 17;65(10):3934-3950. doi: 10.1044/2022_JSLHR-20-00749. Epub 2022 Oct 4.
8
Encoding speech rate in challenging listening conditions: White noise and reverberation.
Atten Percept Psychophys. 2022 Oct;84(7):2303-2318. doi: 10.3758/s13414-022-02554-8. Epub 2022 Aug 22.
9
Sooo Sweeet! Presence of Long Vowels in Brand Names Lead to Expectations of Sweetness.
Behav Sci (Basel). 2021 Jan 20;11(2):12. doi: 10.3390/bs11020012.
10
Predictions of Speech Chimaera Intelligibility Using Auditory Nerve Mean-Rate and Spike-Timing Neural Cues.
J Assoc Res Otolaryngol. 2017 Oct;18(5):687-710. doi: 10.1007/s10162-017-0627-7. Epub 2017 Jul 26.

本文引用的文献

3
Cochlea-scaled entropy, not consonants, vowels, or time, best predicts speech intelligibility.
Proc Natl Acad Sci U S A. 2010 Jul 6;107(27):12387-92. doi: 10.1073/pnas.0913625107. Epub 2010 Jun 21.
4
Perceptual contributions of the consonant-vowel boundary to sentence intelligibility.
J Acoust Soc Am. 2009 Aug;126(2):847-57. doi: 10.1121/1.3159302.
5
Differential processing of consonants and vowels in lexical access through reading.
Psychol Sci. 2008 Dec;19(12):1223-7. doi: 10.1111/j.1467-9280.2008.02228.x.
6
The effect of fundamental frequency on the intelligibility of speech with flattened intonation contours.
Am J Speech Lang Pathol. 2008 Nov;17(4):348-55. doi: 10.1044/1058-0360(2008/07-0048). Epub 2008 Oct 7.
7
Exploring the role of the modulation spectrum in phoneme recognition.
Ear Hear. 2008 Oct;29(5):800-13. doi: 10.1097/AUD.0b013e31817e73ef.
9
Finding words and rules in a speech stream: functional differences between vowels and consonants.
Psychol Sci. 2008 Feb;19(2):137-44. doi: 10.1111/j.1467-9280.2008.02059.x.
10
Brain activation for consonants and vowels.
Cereb Cortex. 2008 Jul;18(7):1727-35. doi: 10.1093/cercor/bhm202. Epub 2008 Jan 29.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验