• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

韵律信息的频谱分布。

Spectral distribution of prosodic information.

作者信息

Grant K W, Walden B E

机构信息

Walter Reed Army Medical Center, Washington, DC, USA.

出版信息

J Speech Hear Res. 1996 Apr;39(2):228-38. doi: 10.1044/jshr.3902.228.

DOI:10.1044/jshr.3902.228
PMID:8729913
Abstract

Prosodic speech cues for rhythm, stress, and intonation are related primarily to variations in intensity, duration, and fundamental frequency. Because these cues make use of temporal properties of the speech waveform they are likely to be represented broadly across the speech spectrum. In order to determine the relative importance of different frequency regions for the recognition of prosodic cues, identification of four prosodic features, syllable number, syllabic stress, sentence intonation, and phrase boundary location, was evaluated under six filter conditions spanning the range from 200-6100 Hz. Each filter condition had equal articulation index (AI) weights, AI = 0.01; p(C)isolated words approximately equal to 0.40. Results obtained with normally hearing subjects showed that there was an interaction between filter condition and the identification of specific prosodic features. For example, information from high-frequency regions of speech was particularly useful in the identification of syllable number and stress, whereas information from low-frequency regions was helpful in identifying intonation patterns. In spite of these spectral differences, overall listeners performed remarkably well in identifying prosodic patterns, although individual differences were apparent. For some subjects, equivalent levels of performance across the six filter conditions were achieved. These results are discussed in relation to auditory and auditory-visual speech recognition.

摘要

韵律语音线索中的节奏、重音和语调主要与强度、时长和基频的变化有关。由于这些线索利用了语音波形的时间特性,它们很可能在整个语音频谱中广泛分布。为了确定不同频率区域对韵律线索识别的相对重要性,在200 - 6100 Hz范围内的六种滤波条件下,对音节数、音节重音、句子语调以及短语边界位置这四种韵律特征的识别进行了评估。每种滤波条件下的清晰度指数(AI)权重相等,AI = 0.01;孤立单词的p(C)约等于0.40。正常听力受试者的实验结果表明,滤波条件与特定韵律特征的识别之间存在相互作用。例如,语音高频区域的信息在识别音节数和重音方面特别有用,而低频区域的信息有助于识别语调模式。尽管存在这些频谱差异,但总体而言,听众在识别韵律模式方面表现出色,不过个体差异也很明显。对于一些受试者来说,在六种滤波条件下都取得了相当的表现水平。将结合听觉和视听语音识别对这些结果进行讨论。

相似文献

1
Spectral distribution of prosodic information.韵律信息的频谱分布。
J Speech Hear Res. 1996 Apr;39(2):228-38. doi: 10.1044/jshr.3902.228.
2
The use of phrase-level prosodic information in lexical segmentation: evidence from word-spotting experiments in Korean.词汇切分中短语级韵律信息的运用:来自韩语单词识别实验的证据。
J Acoust Soc Am. 2009 May;125(5):3373-86. doi: 10.1121/1.3097777.
3
Integration efficiency for speech perception within and across sensory modalities by normal-hearing and hearing-impaired individuals.正常听力和听力受损个体在感觉模态内及跨感觉模态的语音感知整合效率。
J Acoust Soc Am. 2007 Feb;121(2):1164-76. doi: 10.1121/1.2405859.
4
Multimodal and Spectral Degradation Effects on Speech and Emotion Recognition in Adult Listeners.多模态和光谱降解对成年听众的言语和情感识别的影响。
Trends Hear. 2018 Jan-Dec;22:2331216518804966. doi: 10.1177/2331216518804966.
5
Word recognition for temporally and spectrally distorted materials: the effects of age and hearing loss.语音识别对时间和频谱失真材料的影响:年龄和听力损失的作用。
Ear Hear. 2012 May-Jun;33(3):349-66. doi: 10.1097/AUD.0b013e318242571c.
6
Prosodic structure shapes the temporal realization of intonation and manual gesture movements.韵律结构塑造了语调的时间实现和手动手势运动。
J Speech Lang Hear Res. 2013 Jun;56(3):850-64. doi: 10.1044/1092-4388(2012/12-0049). Epub 2012 Dec 28.
7
The effect of spectral smearing on the identification of pure F0 intonation contours in vocoder simulations of cochlear implants.频谱模糊对人工耳蜗声码器模拟中纯F0语调轮廓识别的影响。
Cochlear Implants Int. 2015 Mar;16(2):77-87. doi: 10.1179/1754762814Y.0000000086. Epub 2014 Jul 7.
8
Production and perception of speech intonation in pediatric cochlear implant recipients and individuals with normal hearing.人工耳蜗植入儿童及听力正常个体的言语语调产生与感知
Ear Hear. 2008 Jun;29(3):336-51. doi: 10.1097/AUD.0b013e318168d94d.
9
Assessment of Spectral and Temporal Resolution in Cochlear Implant Users Using Psychoacoustic Discrimination and Speech Cue Categorization.使用心理声学辨别和语音线索分类评估人工耳蜗使用者的频谱和时间分辨率
Ear Hear. 2016 Nov/Dec;37(6):e377-e390. doi: 10.1097/AUD.0000000000000328.
10
The perception of sentence stress in cochlear implant recipients.人工耳蜗植入者对句重音的感知。
Ear Hear. 2011 Jul-Aug;32(4):459-67. doi: 10.1097/AUD.0b013e3182064882.

引用本文的文献

1
The combination of accent method and phonemic contrast: an innovative strategy to improve speech production on post-stroke dysarthria.重音法与音位对比相结合:一种改善中风后构音障碍言语产生的创新策略。
Front Hum Neurosci. 2024 Jan 8;17:1298974. doi: 10.3389/fnhum.2023.1298974. eCollection 2023.
2
Band importance for speech-in-speech recognition.语音中语音识别的频段重要性。
JASA Express Lett. 2021 Aug;1(8):084402. doi: 10.1121/10.0005762. Epub 2021 Aug 2.
3
Band importance for sentences and words reexamined.重新审视句子和单词的频段重要性。
J Acoust Soc Am. 2013 Jan;133(1):463-73. doi: 10.1121/1.4770246.
4
The effect of lip-reading on primary stream segregation.唇读对主要声道分离的影响。
J Acoust Soc Am. 2011 Jul;130(1):283-91. doi: 10.1121/1.3592223.
5
Perceptual weighting of individual and concurrent cues for sentence intelligibility: frequency, envelope, and fine structure.个体和并发线索对句子可懂度的感知加权:频率、包络和精细结构。
J Acoust Soc Am. 2011 Feb;129(2):977-88. doi: 10.1121/1.3531954.
6
Consistency of sentence intelligibility across difficult listening situations.句子可懂度在困难听力情境中的一致性。
J Speech Lang Hear Res. 2006 Aug;49(4):823-34. doi: 10.1044/1092-4388(2006/058).
7
Prosodic processing by children: an fMRI study.儿童的韵律加工:一项功能磁共振成像研究。
Brain Lang. 2006 Jun;97(3):332-42. doi: 10.1016/j.bandl.2005.12.004. Epub 2006 Feb 7.
8
The influence of the lexicon on speech read word recognition: contrasting segmental and lexical distinctiveness.词汇对言语朗读单词识别的影响:对比音段和词汇的独特性。
Psychon Bull Rev. 2002 Jun;9(2):341-7. doi: 10.3758/bf03196291.