Suppr超能文献

基频轮廓处理对背景噪声中语音可懂度的影响。

The effects of fundamental frequency contour manipulations on speech intelligibility in background noise.

机构信息

Department of Speech-Language-Hearing Sciences, University of Minnesota, Minneapolis, Minnesota 55455, USA.

出版信息

J Acoust Soc Am. 2010 Jul;128(1):435-43. doi: 10.1121/1.3397384.

Abstract

Previous studies have documented that speech with flattened or inverted fundamental frequency (F0) contours is less intelligible than speech with natural variations in F0. The purpose of this present study was to further investigate how F0 manipulations affect speech intelligibility in background noise. Speech recognition in noise was measured for sentences having the following F0 contours: unmodified, flattened at the median, natural but exaggerated, inverted, and sinusoidally frequency modulated at rates of 2.5 and 5.0 Hz, rates shown to make vowels more perceptually salient in background noise. Five talkers produced 180 stimulus sentences, with 30 unique sentences per F0 contour condition. Flattening or exaggerating the F0 contour reduced key word recognition performance by 13% relative to the naturally produced speech. Inverting or sinusoidally frequency modulating the F0 contour reduced performance by 23% relative to typically produced speech. These results support the notion that linguistically incorrect or misleading cues have a greater deleterious effect on speech understanding than linguistically neutral cues.

摘要

先前的研究已经证明,基频(F0)轮廓平坦或倒置的语音比 F0 自然变化的语音更难以理解。本研究的目的是进一步探讨 F0 操纵如何影响背景噪声中的语音可懂度。对具有以下 F0 轮廓的句子进行噪声中的语音识别:未修改、中间部分平坦、自然但夸张、倒置和以 2.5 和 5.0 Hz 的频率正弦调频,这些频率被证明可以使元音在背景噪声中更明显。五位说话者产生了 180 个刺激句子,每个 F0 轮廓条件有 30 个独特的句子。与自然产生的语音相比,F0 轮廓的平坦化或夸张化将关键字词识别性能降低了 13%。与通常产生的语音相比,F0 轮廓的倒置或正弦调频将性能降低了 23%。这些结果支持了这样一种观点,即语言上不正确或误导性的线索对语音理解的负面影响大于语言上中性的线索。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验