Suppr超能文献

增强连续交错采样人工耳蜗中语音音高的时间线索。

Enhancing temporal cues to voice pitch in continuous interleaved sampling cochlear implants.

作者信息

Green Tim, Faulkner Andrew, Rosen Stuart

机构信息

Department of Phonetics and Linguistics, University College London, Wolfson House, London NW1 2HE, United Kingdom.

出版信息

J Acoust Soc Am. 2004 Oct;116(4 Pt 1):2298-310. doi: 10.1121/1.1785611.

Abstract

The limited spectral resolution of cochlear implant systems means that voice pitch perception depends on weak temporal envelope cues. Enhancement of such cues was investigated in implant users and in acoustic simulations. Subjects labeled the pitch movement of processed synthetic diphthongal glides. In standard processing, noise carriers (simulations) or pulse trains (implant users) were modulated by 400 Hz low-pass envelopes. In modified processing, carriers were modulated by two components: (1) Slow-rate (<32 Hz) envelope modulations, conveying dynamic spectral shape changes crucial for speech; (2) a simplified waveform (e.g., a sawtooth) matching the periodicity of the input diphthong. In both normal listeners and implant users performance was better with modified processing, though temporal envelope cues were less effective with higher F0. Factors contributing to the advantage for modified processing may include increased modulation depth and use of a modulation waveform featuring a rapid onset in each period, resulting in a clearer representation of F0 in the neural firing pattern. Eliminating slow-rate spectral dynamics, so that within-channel amplitude changes solely reflected F0, showed that dynamic spectral variation obscured temporal pitch cues. Though significant, advantages for modified processing were small, suggesting that the potential for developing strategies delivering enhanced pitch perception is limited.

摘要

人工耳蜗系统有限的频谱分辨率意味着语音音高感知依赖于微弱的时间包络线索。在植入用户和声学模拟中对这类线索的增强进行了研究。受试者对经过处理的合成双元音滑音的音高变化进行标注。在标准处理中,噪声载波(模拟)或脉冲序列(植入用户)由400赫兹低通包络进行调制。在改进处理中,载波由两个分量进行调制:(1)慢速(<32赫兹)包络调制,传达对语音至关重要的动态频谱形状变化;(2)与输入双元音的周期性相匹配的简化波形(如锯齿波)。在正常听力者和植入用户中,改进处理的表现都更好,不过随着基频(F0)升高,时间包络线索的效果会变差。改进处理具有优势的因素可能包括调制深度增加以及使用在每个周期具有快速起始的调制波形,从而在神经放电模式中更清晰地呈现F0。消除慢速频谱动态变化以使通道内幅度变化仅反映F0,结果表明动态频谱变化会掩盖时间音高线索。尽管改进处理的优势显著,但幅度较小,这表明开发能够增强音高感知的策略的潜力有限。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验