Marrufo-Pérez Miriam I, Lopez-Poveda Enrique A
Instituto de Neurociencias de Castilla y León (INCYL), Universidad de Salamanca, Salamanca, Spain.
Instituto de Investigación Biomédica de Salamanca (IBSAL), Universidad de Salamanca, Salamanca, Spain.
Trends Hear. 2025 Jan-Dec;29:23312165251343457. doi: 10.1177/23312165251343457. Epub 2025 May 15.
The recognition of isolated words in noise improves as words are delayed from the noise onset. This phenomenon, known as adaptation to noise, has been mostly investigated using synthetic noises. The aim here was to investigate whether adaptation occurs for realistic noises and to what extent it depends on the spectrum and level fluctuations of the noise. Forty-nine different realistic and synthetic noises were analyzed and classified according to how much they fluctuated in level over time and how much their spectra differed from the speech spectrum. Six representative noises were chosen that covered the observed range of level fluctuations and spectral differences but could still mask speech. For the six noises, speech reception thresholds (SRTs) were measured for natural and tone-vocoded words delayed 50 (early condition) and 800 ms (late condition) from the noise onset. Adaptation was calculated as the SRT improvement in the late relative to the early condition. Twenty-two adults with normal hearing participated in the experiments. For natural words, adaptation was small overall (mean = 0.5 dB) and similar across the six noises. For vocoded words, significant adaptation occurred for all six noises (mean = 1.3 dB) and was not statistically different across noises. For the tested noises, the amount of adaptation was independent of the spectrum and level fluctuations of the noise. The results suggest that adaptation in speech recognition can occur in realistic noisy environments.
随着单词相对于噪声起始时间的延迟,在噪声中对孤立单词的识别能力会提高。这种现象,即所谓的噪声适应,大多是使用合成噪声进行研究的。本文的目的是研究在现实噪声环境中是否会发生适应现象,以及适应程度在多大程度上取决于噪声的频谱和电平波动。分析了49种不同的现实噪声和合成噪声,并根据它们随时间的电平波动程度以及它们的频谱与语音频谱的差异程度进行分类。选择了六种具有代表性的噪声,它们涵盖了观察到的电平波动范围和频谱差异,但仍能掩蔽语音。对于这六种噪声,测量了自然词和声调编码词在相对于噪声起始延迟50毫秒(早期条件)和800毫秒(晚期条件)时的言语接受阈值(SRT)。适应程度通过晚期条件相对于早期条件下SRT的改善来计算。22名听力正常的成年人参与了实验。对于自然词,总体适应程度较小(平均值 = 0.5分贝),且在六种噪声中相似。对于编码词,所有六种噪声都出现了显著的适应(平均值 = 1.3分贝),且不同噪声之间在统计学上没有差异。对于测试的噪声,适应程度与噪声的频谱和电平波动无关。结果表明,在现实嘈杂环境中,语音识别的适应现象是可以发生的。