Suppr超能文献

频谱归一化对人工耳蜗使用者不同说话者语音识别的影响。

Effect of spectral normalization on different talker speech recognition by cochlear implant users.

作者信息

Liu Chuping, Galvin John, Fu Qian-Jie, Narayanan Shrikanth S

机构信息

Department of Electrical Engineering, University of Southern California, Los Angeles, California 90089, USA.

出版信息

J Acoust Soc Am. 2008 May;123(5):2836-47. doi: 10.1121/1.2897047.

Abstract

In cochlear implants (CIs), different talkers often produce different levels of speech understanding because of the spectrally distorted speech patterns provided by the implant device. A spectral normalization approach was used to transform the spectral characteristics of one talker to those of another talker. In Experiment 1, speech recognition with two talkers was measured in CI users, with and without spectral normalization. Results showed that the spectral normalization algorithm had small but significant effect on performance. In Experiment 2, the effects of spectral normalization were measured in CI users and normal-hearing (NH) subjects; a pitch-stretching technique was used to simulate six talkers with different fundamental frequencies and vocal tract configurations. NH baseline performance was nearly perfect with these pitch-shift transformations. For CI subjects, while there was considerable intersubject variability in performance with the different pitch-shift transformations, spectral normalization significantly improved the intelligibility of these simulated talkers. The results from Experiments 1 and 2 demonstrate that spectral normalization toward more-intelligible talkers significantly improved CI users' speech understanding with less-intelligible talkers. The results suggest that spectral normalization using optimal reference patterns for individual CI patients may compensate for some of the acoustic variability across talkers.

摘要

在人工耳蜗(CI)中,由于植入设备提供的频谱失真语音模式,不同的说话者通常会产生不同程度的语音理解。一种频谱归一化方法被用于将一个说话者的频谱特征转换为另一个说话者的频谱特征。在实验1中,在有和没有频谱归一化的情况下,测量了CI使用者对两个说话者的语音识别。结果表明,频谱归一化算法对性能有微小但显著的影响。在实验2中,在CI使用者和正常听力(NH)受试者中测量了频谱归一化的效果;使用了一种音高拉伸技术来模拟六个具有不同基频和声道配置的说话者。对于这些音高偏移变换,NH的基线性能几乎是完美的。对于CI受试者,虽然在不同的音高偏移变换下性能存在相当大的个体间差异,但频谱归一化显著提高了这些模拟说话者的可懂度。实验1和2的结果表明,朝着更易理解的说话者进行频谱归一化显著提高了CI使用者对较难理解的说话者的语音理解。结果表明,为个体CI患者使用最佳参考模式进行频谱归一化可能会补偿不同说话者之间的一些声学变异性。

相似文献

9
The Perception of Regional Dialects and Foreign Accents by Cochlear Implant Users.人工耳蜗使用者对地区方言和外国口音的感知。
J Speech Lang Hear Res. 2021 Feb 17;64(2):683-690. doi: 10.1044/2020_JSLHR-20-00496. Epub 2021 Jan 25.

本文引用的文献

6
Clear speech perception in acoustic and electric hearing.声学和电听觉中的清晰言语感知。
J Acoust Soc Am. 2004 Oct;116(4 Pt 1):2374-83. doi: 10.1121/1.1787528.
8
Recognition of spectrally degraded and frequency-shifted vowels in acoustic and electric hearing.
J Acoust Soc Am. 1999 Mar;105(3):1889-900. doi: 10.1121/1.426725.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验