Suppr超能文献

噪声激励的连续交错采样人工耳蜗植入物声码器模拟中音调的频谱和时间线索。

Spectral and temporal cues to pitch in noise-excited vocoder simulations of continuous-interleaved-sampling cochlear implants.

作者信息

Green Tim, Faulkner Andrew, Rosen Stuart

机构信息

Department of Phonetics and Linguistics, University College London, United Kingdom.

出版信息

J Acoust Soc Am. 2002 Nov;112(5 Pt 1):2155-64. doi: 10.1121/1.1506688.

Abstract

Four-band and single-band noise-excited vocoders were used in acoustic simulations to investigate spectral and temporal cues to melodic pitch in the output of a cochlear implant speech processor. Noise carriers were modulated by amplitude envelopes extracted by half-wave rectification and low-pass filtering at 32 or 400 Hz. The four-band, but not the single-band processors, may preserve spectral correlates of fundamental frequency (F0). Envelope smoothing at 400 Hz preserves temporal correlates of F0, which are eliminated with 32-Hz smoothing. Inputs to the processors were sawtooth frequency glides, in which spectral variation is completely determined by F0, or synthetic diphthongal vowel glides, whose spectral shape is dominated by varying formant resonances. Normal listeners labeled the direction of pitch movement of the processed stimuli. For processed sawtooth waves, purely temporal cues led to decreasing performance with increasing F0. With purely spectral cues, performance was above chance despite the limited spectral resolution of the processors. For processed diphthongs, performance with purely spectral cues was at chance, showing that spectral envelope changes due to formant movement obscured spectral cues to F0. Performance with temporal cues was poorer for diphthongs than for sawtooths, with very limited discrimination at higher F0. These data suggest that, for speech signals through a typical cochlear implant processor, spectral cues to pitch are likely to have limited utility, while temporal envelope cues may be useful only at low F0.

摘要

在声学模拟中使用了四频段和单频段噪声激励声码器,以研究人工耳蜗语音处理器输出中旋律音高的频谱和时间线索。噪声载波由通过半波整流和32或400Hz低通滤波提取的幅度包络调制。四频段处理器(而非单频段处理器)可能会保留基频(F0)的频谱相关性。400Hz的包络平滑保留了F0的时间相关性,而32Hz平滑会消除这些相关性。处理器的输入是锯齿波频率滑动,其中频谱变化完全由F0决定,或者是合成双元音滑动,其频谱形状由变化的共振峰共振主导。正常听众对处理后的刺激的音高移动方向进行了标注。对于处理后的锯齿波,纯粹的时间线索导致随着F0增加性能下降。对于纯粹的频谱线索,尽管处理器的频谱分辨率有限,但性能仍高于随机水平。对于处理后的双元音,纯粹频谱线索的性能处于随机水平,表明由于共振峰移动导致的频谱包络变化掩盖了F0的频谱线索。双元音的时间线索性能比锯齿波差,在较高F0时辨别能力非常有限。这些数据表明,对于通过典型人工耳蜗处理器的语音信号,音高的频谱线索可能效用有限,而时间包络线索可能仅在低F0时有用。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验