Suppr超能文献

在存在竞争说话者的情况下,言语中有用的时域精细结构线索的动态范围。

The dynamic range of useful temporal fine structure cues for speech in the presence of a competing talker.

机构信息

Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom.

出版信息

J Acoust Soc Am. 2011 Oct;130(4):2162-72. doi: 10.1121/1.3625237.

Abstract

Within an auditory channel, the speech waveform contains both temporal envelope (E(O)) and temporal fine structure (TFS) information. Vocoder processing extracts a modified version of the temporal envelope (E') within each channel and uses it to modulate a channel carrier. The resulting signal, E'(Carr), has reduced information content compared to the original "E(O) + TFS" signal. The dynamic range over which listeners make additional use of E(O) + TFS over E'(Carr) cues was investigated in a competing-speech task. The target-and-background mixture was processed using a 30-channel vocoder. In each channel, E(O) + TFS replaced E'(Carr) at either the peaks or the valleys of the signal. The replacement decision was based on comparing the short-term channel level to a parametrically varied "switching threshold," expressed relative to the long-term channel level. Intelligibility was measured as a function of switching threshold, carrier type, target-to-background ratio, and replacement method. Scores showed a dependence on all four parameters. Derived intensity-importance functions (IIFs) showed that E(O) + TFS information from 8-13 dB below to 10 dB above the channel long-term level was important. When E(O) + TFS information was added at the peaks, IIFs peaked around -2 dB, but when E(O) + TFS information was added at the valleys, the peaks lay around +1 dB.

摘要

在听觉通道内,语音信号包含时域包络(E(O))和时域精细结构(TFS)信息。声码器处理提取每个通道内的时域包络(E')的修改版本,并使用它来调制通道载波。与原始的“E(O) + TFS”信号相比,所得信号 E'(Carr) 的信息含量降低。在竞争语音任务中,研究了听众在多大程度上可以利用 E(O) + TFS 超过 E'(Carr) 线索。目标和背景混合物使用 30 通道声码器进行处理。在每个通道中,E(O) + TFS 在信号的峰值或谷值处替换 E'(Carr)。替换决策是基于将短期通道电平与参数化变化的“切换阈值”进行比较,该阈值相对于长期通道电平表达。根据切换阈值、载波类型、目标与背景比和替换方法来衡量可懂度。得分显示出对所有四个参数的依赖性。得出的强度重要性函数(IIFs)表明,E(O) + TFS 信息在通道长期水平以下 8-13dB 到 10dB 以上是重要的。当 E(O) + TFS 信息在峰值处添加时,IIFs 在-2dB 左右达到峰值,但当 E(O) + TFS 信息在谷值处添加时,峰值位于+1dB 左右。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验