• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用听觉神经平均发放率和峰电位时间神经线索预测言语嵌合体可懂度

Predictions of Speech Chimaera Intelligibility Using Auditory Nerve Mean-Rate and Spike-Timing Neural Cues.

作者信息

Wirtzfeld Michael R, Ibrahim Rasha A, Bruce Ian C

机构信息

Department of Electrical and Computer Engineering, McMaster University, 1280 Main Street West, Hamilton, L8S 4K1, ON, Canada.

出版信息

J Assoc Res Otolaryngol. 2017 Oct;18(5):687-710. doi: 10.1007/s10162-017-0627-7. Epub 2017 Jul 26.

DOI:10.1007/s10162-017-0627-7
PMID:28748487
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5612921/
Abstract

Perceptual studies of speech intelligibility have shown that slow variations of acoustic envelope (ENV) in a small set of frequency bands provides adequate information for good perceptual performance in quiet, whereas acoustic temporal fine-structure (TFS) cues play a supporting role in background noise. However, the implications for neural coding are prone to misinterpretation because the mean-rate neural representation can contain recovered ENV cues from cochlear filtering of TFS. We investigated ENV recovery and spike-time TFS coding using objective measures of simulated mean-rate and spike-timing neural representations of chimaeric speech, in which either the ENV or the TFS is replaced by another signal. We (a) evaluated the levels of mean-rate and spike-timing neural information for two categories of chimaeric speech, one retaining ENV cues and the other TFS; (b) examined the level of recovered ENV from cochlear filtering of TFS speech; (c) examined and quantified the contribution to recovered ENV from spike-timing cues using a lateral inhibition network (LIN); and (d) constructed linear regression models with objective measures of mean-rate and spike-timing neural cues and subjective phoneme perception scores from normal-hearing listeners. The mean-rate neural cues from the original ENV and recovered ENV partially accounted for perceptual score variability, with additional variability explained by the recovered ENV from the LIN-processed TFS speech. The best model predictions of chimaeric speech intelligibility were found when both the mean-rate and spike-timing neural cues were included, providing further evidence that spike-time coding of TFS cues is important for intelligibility when the speech envelope is degraded.

摘要

言语可懂度的感知研究表明,在一小部分频带中,声学包络(ENV)的缓慢变化为安静环境下良好的感知性能提供了足够的信息,而声学时间精细结构(TFS)线索在背景噪声中起辅助作用。然而,对于神经编码的影响容易产生误解,因为平均速率神经表征可能包含从TFS的耳蜗滤波中恢复的ENV线索。我们使用嵌合语音的模拟平均速率和尖峰时间神经表征的客观测量方法,研究了ENV恢复和尖峰时间TFS编码,其中ENV或TFS被另一个信号取代。我们(a)评估了两类嵌合语音的平均速率和尖峰时间神经信息水平,一类保留ENV线索,另一类保留TFS;(b)检查了从TFS语音的耳蜗滤波中恢复的ENV水平;(c)使用侧向抑制网络(LIN)检查并量化了尖峰时间线索对恢复的ENV的贡献;(d)构建了线性回归模型,该模型包含平均速率和尖峰时间神经线索的客观测量以及正常听力听众的主观音素感知分数。来自原始ENV和恢复的ENV的平均速率神经线索部分解释了感知分数的变异性,LIN处理的TFS语音恢复的ENV解释了额外的变异性。当同时包含平均速率和尖峰时间神经线索时,发现了对嵌合语音可懂度的最佳模型预测,这进一步证明了当语音包络退化时,TFS线索的尖峰时间编码对可懂度很重要。

相似文献

1
Predictions of Speech Chimaera Intelligibility Using Auditory Nerve Mean-Rate and Spike-Timing Neural Cues.使用听觉神经平均发放率和峰电位时间神经线索预测言语嵌合体可懂度
J Assoc Res Otolaryngol. 2017 Oct;18(5):687-710. doi: 10.1007/s10162-017-0627-7. Epub 2017 Jul 26.
2
The role of recovered envelope cues in the identification of temporal-fine-structure speech for hearing-impaired listeners.恢复的包络线索在听力受损听众识别语音时间精细结构中的作用。
J Acoust Soc Am. 2015 Jan;137(1):505-8. doi: 10.1121/1.4904540.
3
Role of Binaural Temporal Fine Structure and Envelope Cues in Cocktail-Party Listening.双耳时间精细结构和包络线索在鸡尾酒会式聆听中的作用。
J Neurosci. 2016 Aug 3;36(31):8250-7. doi: 10.1523/JNEUROSCI.4421-15.2016.
4
Distorting temporal fine structure by phase shifting and its effects on speech intelligibility and neural phase locking.通过相移扭曲时间精细结构及其对语音可懂度和神经相位锁定的影响。
Sci Rep. 2017 Oct 17;7(1):13387. doi: 10.1038/s41598-017-12975-3.
5
Level considerations for chimeric processing: Temporal envelope and fine structure contributions to speech intelligibility.嵌合处理的水平考量:时间包络和精细结构对言语可懂度的贡献。
J Acoust Soc Am. 2015 Nov;138(5):EL459-64. doi: 10.1121/1.4935079.
6
Consonant identification in noise using Hilbert-transform temporal fine-structure speech and recovered-envelope speech for listeners with normal and impaired hearing.使用希尔伯特变换时间精细结构语音和恢复包络语音对听力正常和受损的听众进行噪声中的辅音识别。
J Acoust Soc Am. 2015 Jul;138(1):389-403. doi: 10.1121/1.4922949.
7
The effects of the addition of low-level, low-noise noise on the intelligibility of sentences processed to remove temporal envelope information.添加低水平、低噪声对去除时间包络信息后的句子可懂度的影响。
J Acoust Soc Am. 2010 Oct;128(4):2150-61. doi: 10.1121/1.3478773.
8
Psychophysiological analyses demonstrate the importance of neural envelope coding for speech perception in noise.心理生理分析表明,神经包络编码对噪声中言语感知很重要。
J Neurosci. 2012 Feb 1;32(5):1747-56. doi: 10.1523/JNEUROSCI.4493-11.2012.
9
Optimal combination of neural temporal envelope and fine structure cues to explain speech identification in background noise.优化神经时间包络和精细结构线索的组合,以解释背景噪声中的语音识别。
J Neurosci. 2014 Sep 3;34(36):12145-54. doi: 10.1523/JNEUROSCI.1025-14.2014.
10
The effects of noise vocoding on speech quality perception.噪声语音编码对语音质量感知的影响。
Hear Res. 2014 Mar;309:75-83. doi: 10.1016/j.heares.2013.11.011. Epub 2013 Dec 11.

引用本文的文献

1
Speech sound discrimination in background noise across the lifespan: a comparative study in Mongolian gerbils and humans.不同年龄段在背景噪声中的语音辨别能力:蒙古沙鼠与人类的比较研究
Front Aging Neurosci. 2025 Jun 9;17:1570305. doi: 10.3389/fnagi.2025.1570305. eCollection 2025.
2
Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.针对现实世界任务进行优化的模型揭示了听觉中精确时间编码的任务依赖性必要性。
Nat Commun. 2024 Dec 4;15(1):10590. doi: 10.1038/s41467-024-54700-5.
3
Models optimized for real-world tasks reveal the task-dependent necessity of precise temporal coding in hearing.针对现实世界任务进行优化的模型揭示了听力中精确时间编码在任务依赖方面的必要性。
bioRxiv. 2024 Sep 16:2024.04.21.590435. doi: 10.1101/2024.04.21.590435.
4
Predicting speech intelligibility in hearing-impaired listeners using a physiologically inspired auditory model.利用生理启发式听觉模型预测听力受损者的言语可懂度。
Hear Res. 2022 Dec;426:108553. doi: 10.1016/j.heares.2022.108553. Epub 2022 Jun 9.
5
A Machine Learning-based Neural Implant Front End for Inducing Naturalistic Firing.基于机器学习的神经植入前端,用于诱导自然发放。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:5713-5718. doi: 10.1109/EMBC46164.2021.9630548.
6
Noise-Sensitive But More Precise Subcortical Representations Coexist with Robust Cortical Encoding of Natural Vocalizations.噪声敏感但更精确的皮质下表示与自然发声的强大皮质编码共存。
J Neurosci. 2020 Jul 1;40(27):5228-5246. doi: 10.1523/JNEUROSCI.2731-19.2020. Epub 2020 May 22.

本文引用的文献

1
Reference-Free Assessment of Speech Intelligibility Using Bispectrum of an Auditory Neurogram.基于听觉神经图双谱估计的免参考语音可懂度评估
PLoS One. 2016 Mar 11;11(3):e0150415. doi: 10.1371/journal.pone.0150415. eCollection 2016.
2
Consonant identification in noise using Hilbert-transform temporal fine-structure speech and recovered-envelope speech for listeners with normal and impaired hearing.使用希尔伯特变换时间精细结构语音和恢复包络语音对听力正常和受损的听众进行噪声中的辅音识别。
J Acoust Soc Am. 2015 Jul;138(1):389-403. doi: 10.1121/1.4922949.
3
The role of recovered envelope cues in the identification of temporal-fine-structure speech for hearing-impaired listeners.恢复的包络线索在听力受损听众识别语音时间精细结构中的作用。
J Acoust Soc Am. 2015 Jan;137(1):505-8. doi: 10.1121/1.4904540.
4
Consonant identification using temporal fine structure and recovered envelope cues.利用时间精细结构和恢复的包络线索进行辅音识别。
J Acoust Soc Am. 2014 Apr;135(4):2078-90. doi: 10.1121/1.4865920.
5
Updated parameters and expanded simulation options for a model of the auditory periphery.听觉外周模型的更新参数和扩展模拟选项。
J Acoust Soc Am. 2014 Jan;135(1):283-6. doi: 10.1121/1.4837815.
6
A multi-resolution envelope-power based model for speech intelligibility.基于多分辨率包络功率的语音可懂度模型。
J Acoust Soc Am. 2013 Jul;134(1):436-46. doi: 10.1121/1.4807563.
7
On the controversy about the sharpness of human cochlear tuning.关于人类耳蜗调谐锐度的争议。
J Assoc Res Otolaryngol. 2013 Oct;14(5):673-86. doi: 10.1007/s10162-013-0397-9. Epub 2013 May 21.
8
On the balance of envelope and temporal fine structure in the encoding of speech in the early auditory system.在早期听觉系统中语音编码的包络和时序精细结构的平衡。
J Acoust Soc Am. 2013 May;133(5):2818-33. doi: 10.1121/1.4795783.
9
The role of vowel and consonant fundamental frequency, envelope, and temporal fine structure cues to the intelligibility of words and sentences.元音和辅音基频、包络和时域精细结构线索对单词和句子可懂度的作用。
J Acoust Soc Am. 2012 Feb;131(2):1490-501. doi: 10.1121/1.3676696.
10
Psychophysiological analyses demonstrate the importance of neural envelope coding for speech perception in noise.心理生理分析表明,神经包络编码对噪声中言语感知很重要。
J Neurosci. 2012 Feb 1;32(5):1747-56. doi: 10.1523/JNEUROSCI.4493-11.2012.