• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

频率压缩扩展与偏移对语音识别的综合影响。

Combined effects of frequency compression-expansion and shift on speech recognition.

作者信息

Başkent Deniz, Shannon Robert V

机构信息

Department of Biomedical Engineering, University of Southern California, Los Angeles, USA.

出版信息

Ear Hear. 2007 Jun;28(3):277-89. doi: 10.1097/AUD.0b013e318050d398.

DOI:10.1097/AUD.0b013e318050d398
PMID:17485977
Abstract

OBJECTIVE

To explore combined acute effects of frequency shift and compression-expansion on speech recognition, using noiseband vocoder processing.

DESIGN

Recognition of vowels and consonants, processed with a noiseband vocoder, was measured with five normal-hearing subjects, between the ages of 27 and 35 yr. The speech signal was filtered into 8 or 16 analysis bands and the envelopes were extracted from each band. The carrier noise bands were modulated by the envelopes and resynthesized to produce the processed speech. In the baseline matched condition, the frequency ranges of the corresponding analysis and carrier bands were the same. In the shift only condition, the frequency ranges of the carrier bands were shifted up or down relative to the analysis bands. In the compression and expansion only conditions, the analysis band range was made larger or smaller, respectively, than the carrier band range. By applying the shift to carrier bands and compression or expansion to analysis bands simultaneously, the combined effects of the two spectral distortions on speech recognition were explored.

RESULTS

When the spectral distortions of compression-expansion or shift were applied separately, the performance was reduced from the baseline matched condition. However, when the two spectral degradations were applied simultaneously, a compensatory effect was observed; the reduction in performance was smaller for some combinations compared to the reduction observed for each distortion individually.

CONCLUSIONS

The results of the present study are consistent with previous vocoder studies with normal-hearing subjects that showed a negative effect of spectral mismatch between analysis and carrier bands on speech recognition. The present results further show that matching the frequency ranges of 1 to 2 kHz, which contain important speech information, can be more beneficial for speech recognition than matching the overall frequency ranges, in certain conditions.

摘要

目的

使用噪声带声码器处理,探究频移和压缩-扩展对语音识别的联合急性效应。

设计

对5名年龄在27至35岁之间的听力正常受试者进行测试,测量经噪声带声码器处理后的元音和辅音识别情况。语音信号被过滤到8个或16个分析频段,并从每个频段提取包络。载波频段由包络调制并重新合成以产生处理后的语音。在基线匹配条件下,相应分析频段和载波频段的频率范围相同。在仅频移条件下,载波频段的频率范围相对于分析频段向上或向下移动。在仅压缩和扩展条件下,分析频段范围分别比载波频段范围变大或变小。通过同时对载波频段应用频移以及对分析频段应用压缩或扩展,探究了这两种频谱失真对语音识别的联合效应。

结果

当分别应用压缩-扩展或频移的频谱失真时,与基线匹配条件相比,性能有所下降。然而,当同时应用这两种频谱退化时,观察到了一种补偿效应;与单独观察到的每种失真导致的性能下降相比,某些组合的性能下降较小。

结论

本研究结果与先前对听力正常受试者的声码器研究一致,该研究表明分析频段和载波频段之间的频谱失配对语音识别有负面影响。本研究结果进一步表明,在某些条件下,匹配包含重要语音信息的1至2 kHz频率范围对语音识别可能比匹配整体频率范围更有益。

相似文献

1
Combined effects of frequency compression-expansion and shift on speech recognition.频率压缩扩展与偏移对语音识别的综合影响。
Ear Hear. 2007 Jun;28(3):277-89. doi: 10.1097/AUD.0b013e318050d398.
2
Using genetic algorithms with subjective input from human subjects: implications for fitting hearing aids and cochlear implants.将遗传算法与人类受试者的主观输入相结合:对助听器和人工耳蜗适配的启示。
Ear Hear. 2007 Jun;28(3):370-80. doi: 10.1097/AUD.0b013e318047935e.
3
Effects of vowel context on the recognition of initial and medial consonants by cochlear implant users.元音语境对人工耳蜗使用者识别词首和词中辅音的影响。
Ear Hear. 2006 Dec;27(6):658-77. doi: 10.1097/01.aud.0000240543.31567.54.
4
Spectral and temporal cues for phoneme recognition in noise.噪声中音素识别的频谱和时间线索。
J Acoust Soc Am. 2007 Sep;122(3):1758. doi: 10.1121/1.2767000.
5
Effects of single-channel phonemic compression schemes on the understanding of speech by hearing-impaired listeners.
Audiology. 2001 Jan-Feb;40(1):10-25.
6
Temporal envelope changes of compression and speech rate: combined effects on recognition for older adults.压缩和语速的时间包络变化:对老年人识别能力的综合影响。
J Speech Lang Hear Res. 2007 Oct;50(5):1123-38. doi: 10.1044/1092-4388(2007/078).
7
Speech recognition in noise: estimating effects of compressive nonlinearities in the basilar-membrane response.噪声中的语音识别:估计基底膜反应中压缩非线性的影响。
Ear Hear. 2007 Sep;28(5):682-93. doi: 10.1097/AUD.0b013e31812f7156.
8
Phonological mismatch makes aided speech recognition in noise cognitively taxing.语音不匹配会使佩戴助听设备时在噪声环境中的语音识别产生认知负担。
Ear Hear. 2007 Dec;28(6):879-92. doi: 10.1097/AUD.0b013e3181576c9c.
9
Effects of spectral smearing and temporal fine structure degradation on speech masking release.频谱模糊和时间精细结构退化对语音掩蔽释放的影响。
J Acoust Soc Am. 2009 Jun;125(6):4023-33. doi: 10.1121/1.3126344.
10
Speech intelligibility in cochlear implant simulations: Effects of carrier type, interfering noise, and subject experience.人工耳蜗模拟中的言语可懂度:载波类型、干扰噪声和受试者经验的影响。
J Acoust Soc Am. 2007 Oct;122(4):2376-88. doi: 10.1121/1.2773993.

引用本文的文献

1
Electrocochleography-Based Tonotopic Map: II. Frequency-to-Place Mismatch Impacts Speech-Perception Outcomes in Cochlear Implant Recipients.基于电 Cochleography 的音调地形图:II. 频率-位置不匹配对人工耳蜗植入者的言语感知结果的影响。
Ear Hear. 2024;45(6):1406-1417. doi: 10.1097/AUD.0000000000001528. Epub 2024 Jun 17.
2
Improved tactile speech perception using audio-to-tactile sensory substitution with formant frequency focusing.使用具有共振峰频率聚焦的声触觉感觉替代来提高触觉语音感知。
Sci Rep. 2024 Feb 28;14(1):4889. doi: 10.1038/s41598-024-55429-3.
3
Frequency-to-Place Mismatch Impacts Cochlear Implant Quality of Life, But Not Speech Recognition.
频率-位置不匹配影响人工耳蜗生活质量,但不影响言语识别。
Laryngoscope. 2024 Jun;134(6):2898-2905. doi: 10.1002/lary.31264. Epub 2024 Jan 12.
4
Improving speech perception for hearing-impaired listeners using audio-to-tactile sensory substitution with multiple frequency channels.使用多频通道音频触觉感觉替代提高听力受损听众的语音感知。
Sci Rep. 2023 Aug 16;13(1):13336. doi: 10.1038/s41598-023-40509-7.
5
Comparison of Two Place-Based Mapping Procedures on Masked Sentence Recognition as a Function of Electrode Array Angular Insertion Depth and Presence of Acoustic Low-Frequency Information: A Simulation Study.基于两种位置映射程序的掩蔽句识别的比较,作为电极阵列角插入深度和存在声学低频信息的函数:一项模拟研究。
Audiol Neurootol. 2023;28(6):478-487. doi: 10.1159/000531262. Epub 2023 Jul 21.
6
Influence of Electric Frequency-to-Place Mismatches on the Early Speech Recognition Outcomes for Electric-Acoustic Stimulation Users.电-位失配对电-声刺激使用者早期言语识别结果的影响。
Am J Audiol. 2023 Mar;32(1):251-260. doi: 10.1044/2022_AJA-21-00254. Epub 2023 Feb 17.
7
Effects of tonotopic matching and spatial cues on segregation of competing speech in simulations of bilateral cochlear implants.声匹配和空间线索对双侧人工耳蜗模拟中竞争语音分离的影响。
PLoS One. 2022 Jul 5;17(7):e0270759. doi: 10.1371/journal.pone.0270759. eCollection 2022.
8
Effect of Place-Based Versus Default Mapping Procedures on Masked Speech Recognition: Simulations of Cochlear Implant Alone and Electric-Acoustic Stimulation.基于位置与默认映射程序对掩蔽语音识别的影响:人工耳蜗单独与电声刺激的模拟。
Am J Audiol. 2022 Jun 2;31(2):322-337. doi: 10.1044/2022_AJA-21-00123. Epub 2022 Apr 8.
9
Toddlers' fast-mapping from noise-vocoded speech.婴儿从噪声编码语音的快速映射。
J Acoust Soc Am. 2020 Apr;147(4):2432. doi: 10.1121/10.0001129.
10
Frequency-to-Place Mismatch: Characterizing Variability and the Influence on Speech Perception Outcomes in Cochlear Implant Recipients.频率-位置失配:对人工耳蜗植入者言语感知结果的变异性及其影响进行特征描述。
Ear Hear. 2020 Sep/Oct;41(5):1349-1361. doi: 10.1097/AUD.0000000000000864.