Fu Q J, Galvin J J, Wang X
Department of Auditory Implants and Perception, House Ear Institute, Los Angeles, California 90057, USA.
J Acoust Soc Am. 2001 Jan;109(1):379-84. doi: 10.1121/1.1327578.
This study evaluated the effects of time compression and expansion on sentence recognition by normal-hearing (NH) listeners and cochlear-implant (CI) recipients of the Nucleus-22 device. Sentence recognition was measured in five CI users using custom 4-channel continuous interleaved sampler (CIS) processors and five NH listeners using either 4-channel or 32-channel noise-band processors. For NH listeners, recognition was largely unaffected by time expansion, regardless of spectral resolution. However, recognition of time-compressed speech varied significantly with spectral resolution. When fine spectral resolution (32 channels) was available, speech recognition was unaffected even when the duration of sentences was shortened to 40% of their original length (equivalent to a mean duration of 40 ms/phoneme). However, a mean duration of 60 ms/phoneme was required to achieve the same level of recognition when only coarse spectral resolution (4 channels) was available. Recognition patterns were highly variable across CI listeners. The best CI listener performed as well as NH subjects listening to corresponding spectral conditions; however, three out of five CI listeners performed significantly poorer in recognizing time-compressed speech. Further investigation revealed that these three poorer-performing CI users also had more difficulty with simple temporal gap-detection tasks. The results indicate that limited spectral resolution reduces the ability to recognize time-compressed speech. Some CI listeners have more difficulty with time-compressed speech, as produced by rapid speakers, because of reduced spectral resolution and deficits in auditory temporal processing.
本研究评估了时间压缩和扩展对正常听力(NH)听众以及使用Nucleus-22装置的人工耳蜗(CI)植入者句子识别的影响。使用定制的4通道连续交错采样器(CIS)处理器对5名CI使用者进行句子识别测量,使用4通道或32通道噪声带处理器对5名NH听众进行测量。对于NH听众,无论频谱分辨率如何,时间扩展对识别的影响不大。然而,时间压缩语音的识别随频谱分辨率有显著变化。当有精细频谱分辨率(32通道)时,即使句子时长缩短至原始长度的40%(相当于平均时长40毫秒/音素),语音识别也不受影响。然而,当只有粗糙频谱分辨率(4通道)时,要达到相同的识别水平则需要平均时长60毫秒/音素。CI听众的识别模式差异很大。表现最佳的CI听众与在相应频谱条件下聆听的NH受试者表现相当;然而,五名CI听众中有三名在识别时间压缩语音方面表现明显较差。进一步调查发现,这三名表现较差的CI使用者在简单的时间间隙检测任务中也有更多困难。结果表明,有限的频谱分辨率会降低识别时间压缩语音的能力。一些CI听众在识别语速较快者发出的时间压缩语音时遇到更多困难,原因是频谱分辨率降低以及听觉时间处理存在缺陷。