Suppr超能文献

使用共振峰估计语音处理器的人工耳蜗植入患者的元音和辅音识别

Vowel and consonant recognition of cochlear implant patients using formant-estimating speech processors.

作者信息

Blamey P J, Dowell R C, Brown A M, Clark G M, Seligman P M

出版信息

J Acoust Soc Am. 1987 Jul;82(1):48-57. doi: 10.1121/1.395436.

Abstract

Vowel and consonant confusion matrices were collected in the hearing alone (H), lipreading alone (L), and hearing plus lipreading (HL) conditions for 28 patients participating in the clinical trial of the multiple-channel cochlear implant. All patients were profound-to-totally deaf and "hearing" refers to the presentation of auditory information via the implant. The average scores were 49% for vowels and 37% for consonants in the H condition and the HL scores were significantly higher than the L scores. Information transmission and multidimensional scaling analyses showed that different speech features were conveyed at different levels in the H and L conditions. In the HL condition, the visual and auditory signals provided independent information sources for each feature. For vowels, the auditory signal was the major source of duration information, while the visual signal was the major source of first and second formant frequency information. The implant provided information about the amplitude envelope of the speech and the estimated frequency of the main spectral peak between 800 and 4000 Hz, which was useful for consonant recognition. A speech processor that coded the estimated frequency and amplitude of an additional peak between 300 and 1000 Hz was shown to increase the vowel and consonant recognition in the H condition by improving the transmission of first formant and voicing information.

摘要

在参与多通道人工耳蜗临床试验的28名患者中,收集了在仅听力(H)、仅唇读(L)以及听力加唇读(HL)条件下的元音和辅音混淆矩阵。所有患者均为重度至极重度失聪,“听力”指通过植入物呈现听觉信息。在H条件下,元音的平均得分是49%,辅音的平均得分是37%,HL条件下的得分显著高于L条件下的得分。信息传递和多维尺度分析表明,在H和L条件下,不同的语音特征在不同层面上被传递。在HL条件下,视觉和听觉信号为每个特征提供了独立的信息源。对于元音,听觉信号是时长信息的主要来源,而视觉信号是第一和第二共振峰频率信息的主要来源。植入物提供了有关语音幅度包络以及800至4000赫兹之间主要频谱峰值估计频率的信息,这对辅音识别很有用。一种对300至1000赫兹之间额外峰值的估计频率和幅度进行编码的语音处理器,通过改善第一共振峰和浊音信息的传递,在H条件下提高了元音和辅音的识别率。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验