• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类对声码化语音的频率跟随反应:幅度调制与幅度加频率调制。

Human Frequency Following Responses to Vocoded Speech: Amplitude Modulation Versus Amplitude Plus Frequency Modulation.

机构信息

Department of Communication Disorders, California State University, Los Angeles, California, USA.

Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana, USA.

出版信息

Ear Hear. 2020 Mar/Apr;41(2):300-311. doi: 10.1097/AUD.0000000000000756.

DOI:10.1097/AUD.0000000000000756
PMID:31246660
Abstract

OBJECTIVES

The most commonly employed speech processing strategies in cochlear implants (CIs) only extract and encode amplitude modulation (AM) in a limited number of frequency channels. proposed a novel speech processing strategy that encodes both frequency modulation (FM) and AM to improve CI performance. Using behavioral tests, they reported better speech, speaker, and tone recognition with this novel strategy than with the AM-alone strategy. Here, we used the scalp-recorded human frequency following responses (FFRs) to examine the differences in the neural representation of vocoded speech sounds with AM alone and AM + FM as the spectral and temporal cues were varied. Specifically, we were interested in determining whether the addition of FM to AM improved the neural representation of envelope periodicity (FFRENV) and temporal fine structure (FFRTFS), as reflected in the temporal pattern of the phase-locked neural activity generating the FFR.

DESIGN

FFRs were recorded from 13 normal-hearing, adult listeners in response to the original unprocessed stimulus (a synthetic diphthong /au/ with a 110-Hz fundamental frequency or F0 and a 250-msec duration) and the 2-, 4-, 8- and 16-channel sine vocoded versions of /au/ with AM alone and AM + FM. Temporal waveforms, autocorrelation analyses, fast Fourier Transform, and stimulus-response spectral correlations were used to analyze both the strength and fidelity of the neural representation of envelope periodicity (F0) and TFS (formant structure).

RESULTS

The periodicity strength in the FFRENV decreased more for the AM stimuli than for the relatively resilient AM + FM stimuli as the number of channels was increased. Regardless of the number of channels, a clear spectral peak of FFRENV was consistently observed at the stimulus F0 for all the AM + FM stimuli but not for the AM stimuli. Neural representation as revealed by the spectral correlation of FFRTFS was better for the AM + FM stimuli when compared to the AM stimuli. Neural representation of the time-varying formant-related harmonics as revealed by the spectral correlation was also better for the AM + FM stimuli as compared to the AM stimuli.

CONCLUSIONS

These results are consistent with previously reported behavioral results and suggest that the AM + FM processing strategy elicited brainstem neural activity that better preserved periodicity, temporal fine structure, and time-varying spectral information than the AM processing strategy. The relatively more robust neural representation of AM + FM stimuli observed here likely contributes to the superior performance on speech, speaker, and tone recognition with the AM + FM processing strategy. Taken together, these results suggest that neural information preserved in the FFR may be used to evaluate signal processing strategies considered for CIs.

摘要

目的

在人工耳蜗(CI)中,最常用的语音处理策略仅在有限数量的频率通道中提取和编码幅度调制(AM)。提出了一种新的语音处理策略,该策略同时对频率调制(FM)和 AM 进行编码,以提高 CI 的性能。通过行为测试,他们报告说,与仅 AM 策略相比,这种新策略可以更好地识别语音、说话者和音调。在这里,我们使用头皮记录的人类频率跟随反应(FFR)来检查单独使用 AM 和 AM+FM 作为频谱和时间线索时,编码语音声音的神经表示的差异。具体来说,我们感兴趣的是确定 FM 与 AM 的结合是否改善了包络周期性(FFRENV)和时间精细结构(FFRTFS)的神经表示,这反映在生成 FFR 的锁相神经活动的时间模式中。

设计

FFR 是从 13 名正常听力的成年听众中记录的,他们对原始未处理的刺激(具有 110Hz 基频或 F0 和 250ms 持续时间的合成双元音 /au/)和 2、4、8 和 16 通道正弦语音编码的 /au/ 进行了响应,这些语音编码具有 AM 单独和 AM+FM。使用时间波形、自相关分析、快速傅里叶变换和刺激-反应谱相关来分析包络周期性(F0)和 TFS(共振结构)的神经表示的强度和保真度。

结果

随着通道数量的增加,AM 刺激的 FFRENV 强度比相对有弹性的 AM+FM 刺激下降得更多。无论通道数量如何,所有 AM+FM 刺激的 FFRENV 都在刺激 F0 处始终观察到清晰的谱峰,但 AM 刺激则没有。与 AM 刺激相比,FFRTFS 的谱相关揭示的神经表示更好。与 AM 刺激相比,FFRTFS 的谱相关揭示的时间变化的与共振有关的谐波的神经表示也更好。

结论

这些结果与先前报道的行为结果一致,表明 AM+FM 处理策略引起的脑干神经活动比 AM 处理策略更好地保留了周期性、时间精细结构和时变谱信息。这里观察到的 AM+FM 刺激的相对更稳健的神经表示可能有助于 AM+FM 处理策略在语音、说话者和音调识别方面的优异性能。总之,这些结果表明,在 FFR 中保留的神经信息可用于评估人工耳蜗中考虑的信号处理策略。

相似文献

1
Human Frequency Following Responses to Vocoded Speech: Amplitude Modulation Versus Amplitude Plus Frequency Modulation.人类对声码化语音的频率跟随反应:幅度调制与幅度加频率调制。
Ear Hear. 2020 Mar/Apr;41(2):300-311. doi: 10.1097/AUD.0000000000000756.
2
Human Frequency Following Responses to Vocoded Speech.人类对语音编码语音的频率跟随反应。
Ear Hear. 2017 Sep/Oct;38(5):e256-e267. doi: 10.1097/AUD.0000000000000432.
3
Human Frequency Following Responses to Filtered Speech.人类对滤波语音的频率跟随反应。
Ear Hear. 2021 Jan/Feb;42(1):87-105. doi: 10.1097/AUD.0000000000000902.
4
Human Frequency Following Response: Neural Representation of Envelope and Temporal Fine Structure in Listeners with Normal Hearing and Sensorineural Hearing Loss.人类频率跟随反应:正常听力和感音神经性听力损失听众中包络和时间精细结构的神经表征
Ear Hear. 2016 Mar-Apr;37(2):e91-e103. doi: 10.1097/AUD.0000000000000247.
5
Effects of Temporal Envelope Cutoff Frequency, Number of Channels, and Carrier Type on Brainstem Neural Representation of Pitch in Vocoded Speech.时阈截止频率、声道数和载波类型对语音编码中脑stem 音调神经表示的影响。
J Speech Lang Hear Res. 2022 Aug 17;65(8):3146-3164. doi: 10.1044/2022_JSLHR-21-00576. Epub 2022 Aug 9.
6
Human frequency following responses to iterated rippled noise with positive and negative gain: Differential sensitivity to waveform envelope and temporal fine-structure.人类对具有正负增益的迭代波纹噪声的频率跟随反应:对波形包络和时间精细结构的差异敏感性。
Hear Res. 2018 Sep;367:113-123. doi: 10.1016/j.heares.2018.07.009. Epub 2018 Jul 29.
7
Auditory Brainstem Representation of the Voice Pitch Contours in the Resolved and Unresolved Components of Mandarin Tones.汉语声调的分解成分和未分解成分中语音音高轮廓的听觉脑干表征
Front Neurosci. 2018 Nov 16;12:820. doi: 10.3389/fnins.2018.00820. eCollection 2018.
8
The ability of cochlear implant users to use temporal envelope cues recovered from speech frequency modulation.人工耳蜗使用者利用语音频率调制恢复的时间包络线索的能力。
J Acoust Soc Am. 2012 Aug;132(2):1113-9. doi: 10.1121/1.4726013.
9
Effects of age on F0 discrimination and intonation perception in simulated electric and electroacoustic hearing.年龄对模拟电和电声听力中 F0 辨别和语调感知的影响。
Ear Hear. 2011 Feb;32(1):75-83. doi: 10.1097/AUD.0b013e3181eccfe9.
10
Spectral-Temporal Trade-Off in Vocoded Sentence Recognition: Effects of Age, Hearing Thresholds, and Working Memory.声码器句子识别中的谱时权衡:年龄、听力阈值和工作记忆的影响。
Ear Hear. 2020 Sep/Oct;41(5):1226-1235. doi: 10.1097/AUD.0000000000000840.