Suppr超能文献

背景噪声的离频和频域成分对未处理语音和语音编码掩蔽的相对贡献。

Relative contribution of off- and on-frequency spectral components of background noise to the masking of unprocessed and vocoded speech.

机构信息

Department of Speech and Hearing Science, The Ohio State University, Columbus, Ohio 43210, USA.

出版信息

J Acoust Soc Am. 2010 Oct;128(4):2075-84. doi: 10.1121/1.3478845.

Abstract

The present study examined the relative influence of the off- and on-frequency spectral components of modulated and unmodulated maskers on consonant recognition. Stimuli were divided into 30 contiguous equivalent rectangular bandwidths. The temporal fine structure (TFS) in each "target" band was either left intact or replaced with tones using vocoder processing. Recognition scores for 10, 15 and 20 target bands randomly located in frequency were obtained in quiet and in the presence of all 30 masker bands, only the off-frequency masker bands, or only the on-frequency masker bands. The amount of masking produced by the on-frequency bands was generally comparable to that produced by the broadband masker. However, the difference between these two conditions was often significant, indicating an influence of the off-frequency masker bands, likely through modulation interference or spectral restoration. Although vocoder processing systematically lead to poorer consonant recognition scores, the deficit observed in noise could often be attributed to that observed in quiet. These data indicate that (i) speech recognition is affected by the off-frequency components of the background and (ii) the nature of the target TFS does not systematically affect speech recognition in noise, especially when energetic masking and/or the number of target bands is limited.

摘要

本研究考察了调制和未调制掩蔽声的离频和频域谱分量对辅音识别的相对影响。刺激被分为 30 个连续的等效矩形带宽。每个“目标”频带中的时间精细结构(TFS)要么保持完整,要么使用声码器处理用音调替换。在安静环境中和在存在所有 30 个掩蔽带、仅离频掩蔽带或仅频掩蔽带的情况下,随机位于频率中的 10、15 和 20 个目标频带的识别分数都得到了获取。离频带产生的掩蔽量通常与宽带掩蔽声产生的掩蔽量相当。然而,这两种情况之间的差异通常很显著,表明离频掩蔽带的影响,可能是通过调制干扰或频谱恢复。尽管声码器处理会导致辅音识别分数系统性地下降,但在噪声中观察到的缺陷通常可以归因于在安静环境中观察到的缺陷。这些数据表明:(i)语音识别受到背景的离频分量的影响;(ii)目标 TFS 的性质不会系统性地影响噪声中的语音识别,尤其是在能量掩蔽和/或目标频带数量有限的情况下。

相似文献

2
Indications for temporal fine structure contribution to co-modulation masking release.
J Acoust Soc Am. 2010 Dec;128(6):3614-24. doi: 10.1121/1.3500673.
3
Estimates of basilar-membrane nonlinearity effects on masking of tones and speech.
Ear Hear. 2007 Feb;28(1):2-17. doi: 10.1097/AUD.0b013e3180310212.
5
Effect of masker modulation depth on speech masking release.
Hear Res. 2008 May;239(1-2):60-8. doi: 10.1016/j.heares.2008.01.012. Epub 2008 Feb 2.
6
Combination of binaural and harmonic masking release effects in the detection of a single component in complex tones.
Hear Res. 2018 Mar;359:23-31. doi: 10.1016/j.heares.2017.12.007. Epub 2017 Dec 14.
8
Spatial and temporal disparity in signals and maskers affects signal detection in non-human primates.
Hear Res. 2017 Feb;344:1-12. doi: 10.1016/j.heares.2016.10.013. Epub 2016 Oct 19.
9
Effects of spectral shifting on speech perception in noise.
Hear Res. 2010 Dec 1;270(1-2):81-8. doi: 10.1016/j.heares.2010.09.005. Epub 2010 Sep 22.
10
Simulations of cochlear-implant speech perception in modulated and unmodulated noise.
J Acoust Soc Am. 2010 Aug;128(2):870-80. doi: 10.1121/1.3458817.

引用本文的文献

1
Speech Perception with Spectrally Non-overlapping Maskers as Measure of Spectral Resolution in Cochlear Implant Users.
J Assoc Res Otolaryngol. 2019 Apr;20(2):151-167. doi: 10.1007/s10162-018-00702-2. Epub 2018 Nov 19.
2
Adaptation to Noise in Human Speech Recognition Unrelated to the Medial Olivocochlear Reflex.
J Neurosci. 2018 Apr 25;38(17):4138-4145. doi: 10.1523/JNEUROSCI.0024-18.2018. Epub 2018 Mar 28.
3
Speech recognition for multiple bands: Implications for the Speech Intelligibility Index.
J Acoust Soc Am. 2016 Sep;140(3):2019. doi: 10.1121/1.4962539.
4
6
An algorithm to improve speech recognition in noise for hearing-impaired listeners.
J Acoust Soc Am. 2013 Oct;134(4):3029-38. doi: 10.1121/1.4820893.
8
Spatial release from masking as a function of the spectral overlap of competing talkers.
J Acoust Soc Am. 2013 Jun;133(6):3677-80. doi: 10.1121/1.4803517.
9
Band importance for sentences and words reexamined.
J Acoust Soc Am. 2013 Jan;133(1):463-73. doi: 10.1121/1.4770246.

本文引用的文献

1
On the number of auditory filter outputs needed to understand speech: further evidence for auditory channel independence.
Hear Res. 2009 Sep;255(1-2):99-108. doi: 10.1016/j.heares.2009.06.005. Epub 2009 Jun 16.
2
Effects of spectral smearing and temporal fine structure degradation on speech masking release.
J Acoust Soc Am. 2009 Jun;125(6):4023-33. doi: 10.1121/1.3126344.
6
Selectivity of modulation interference for consonant identification in normal-hearing listeners.
J Acoust Soc Am. 2008 Mar;123(3):1665-72. doi: 10.1121/1.2828067.
7
A detailed study on the effects of noise on speech intelligibility.
J Acoust Soc Am. 2007 Nov;122(5):2865-71. doi: 10.1121/1.2783131.
8
Performance of patients using different cochlear implant systems: effects of input dynamic range.
Ear Hear. 2007 Apr;28(2):260-75. doi: 10.1097/AUD.0b013e3180312607.
9
Speech perception problems of the hearing impaired reflect inability to use temporal fine structure.
Proc Natl Acad Sci U S A. 2006 Dec 5;103(49):18866-9. doi: 10.1073/pnas.0607364103. Epub 2006 Nov 20.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验