Suppr超能文献

基频轮廓处理对背景噪声中语音可懂度的影响。

The effects of fundamental frequency contour manipulations on speech intelligibility in background noise.

机构信息

Department of Speech-Language-Hearing Sciences, University of Minnesota, Minneapolis, Minnesota 55455, USA.

出版信息

J Acoust Soc Am. 2010 Jul;128(1):435-43. doi: 10.1121/1.3397384.

Abstract

Previous studies have documented that speech with flattened or inverted fundamental frequency (F0) contours is less intelligible than speech with natural variations in F0. The purpose of this present study was to further investigate how F0 manipulations affect speech intelligibility in background noise. Speech recognition in noise was measured for sentences having the following F0 contours: unmodified, flattened at the median, natural but exaggerated, inverted, and sinusoidally frequency modulated at rates of 2.5 and 5.0 Hz, rates shown to make vowels more perceptually salient in background noise. Five talkers produced 180 stimulus sentences, with 30 unique sentences per F0 contour condition. Flattening or exaggerating the F0 contour reduced key word recognition performance by 13% relative to the naturally produced speech. Inverting or sinusoidally frequency modulating the F0 contour reduced performance by 23% relative to typically produced speech. These results support the notion that linguistically incorrect or misleading cues have a greater deleterious effect on speech understanding than linguistically neutral cues.

摘要

先前的研究已经证明,基频(F0)轮廓平坦或倒置的语音比 F0 自然变化的语音更难以理解。本研究的目的是进一步探讨 F0 操纵如何影响背景噪声中的语音可懂度。对具有以下 F0 轮廓的句子进行噪声中的语音识别:未修改、中间部分平坦、自然但夸张、倒置和以 2.5 和 5.0 Hz 的频率正弦调频,这些频率被证明可以使元音在背景噪声中更明显。五位说话者产生了 180 个刺激句子,每个 F0 轮廓条件有 30 个独特的句子。与自然产生的语音相比,F0 轮廓的平坦化或夸张化将关键字词识别性能降低了 13%。与通常产生的语音相比,F0 轮廓的倒置或正弦调频将性能降低了 23%。这些结果支持了这样一种观点,即语言上不正确或误导性的线索对语音理解的负面影响大于语言上中性的线索。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验