Suppr超能文献

建模具有时变耳间相位差掩蔽器的双耳掩蔽语音中的迟钝现象。

Modeling Sluggishness in Binaural Unmasking of Speech for Maskers With Time-Varying Interaural Phase Differences.

机构信息

1 Medizinische Physik, Carl von Ossietzky Universität, Oldenburg, Germany.

2 Cluster of Excellence Hearing4All, Carl von Ossietzky Universität, Oldenburg, Germany.

出版信息

Trends Hear. 2018 Jan-Dec;22:2331216517753547. doi: 10.1177/2331216517753547.

Abstract

In studies investigating binaural processing in human listeners, relatively long and task-dependent time constants of a binaural window ranging from 10 ms to 250 ms have been observed. Such time constants are often thought to reflect "binaural sluggishness." In this study, the effect of binaural sluggishness on binaural unmasking of speech in stationary speech-shaped noise is investigated in 10 listeners with normal hearing. In order to design a masking signal with temporally varying binaural cues, the interaural phase difference of the noise was modulated sinusoidally with frequencies ranging from 0.25 Hz to 64 Hz. The lowest, that is the best, speech reception thresholds (SRTs) were observed for the lowest modulation frequency. SRTs increased with increasing modulation frequency up to 4 Hz. For higher modulation frequencies, SRTs remained constant in the range of 1 dB to 1.5 dB below the SRT determined in the diotic situation. The outcome of the experiment was simulated using a short-term binaural speech intelligibility model, which combines an equalization-cancellation (EC) model with the speech intelligibility index. This model segments the incoming signal into 23.2-ms time frames in order to predict release from masking in modulated noises. In order to predict the results from this study, the model required a further time constant applied to the EC mechanism representing binaural sluggishness. The best agreement with perceptual data was achieved using a temporal window of 200 ms in the EC mechanism.

摘要

在研究人类听众的双耳处理时,观察到的双耳窗口的时间常数相对较长且依赖于任务,范围从 10ms 到 250ms。这种时间常数通常被认为反映了“双耳迟钝”。在这项研究中,10 名听力正常的听众研究了双耳迟钝对稳态语音噪声中语音掩蔽的影响。为了设计具有时变双耳线索的掩蔽信号,噪声的耳间相位差以 0.25Hz 至 64Hz 的频率进行正弦调制。最低调制频率下观察到了最佳的语音接收阈值 (SRT)。随着调制频率的增加,SRT 增加到 4Hz 。对于更高的调制频率,SRT 在 1 分贝至 1.5 分贝的范围内保持恒定,低于在同态情况下确定的 SRT。使用短期双耳语音可懂度模型模拟实验结果,该模型将均衡-消除 (EC) 模型与语音可懂度指数相结合。该模型将输入信号分成 23.2ms 的时间帧,以预测调制噪声中的掩蔽释放。为了预测本研究的结果,模型需要进一步的时间常数应用于表示双耳迟钝的 EC 机制。在 EC 机制中使用 200ms 的时间窗口可以实现与感知数据的最佳一致性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c09/5774735/1f929b306990/10.1177_2331216517753547-fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验