Suppr超能文献

噪声变码与背景噪声下的言语感知:一项 EEG 和行为研究。

Speech Perception with Noise Vocoding and Background Noise: An EEG and Behavioral Study.

机构信息

Biomedical Engineering, Parks College of Engineering, Aviation and Technology, Saint Louis University, 3507 Lindell Blvd, St Louis, MO, 63103, USA.

出版信息

J Assoc Res Otolaryngol. 2021 Jun;22(3):349-363. doi: 10.1007/s10162-021-00787-2. Epub 2021 Apr 13.

Abstract

This study explored the physiological response of the human brain to degraded speech syllables. The degradation was introduced using noise vocoding and/or background noise. The goal was to identify physiological features of auditory-evoked potentials (AEPs) that may explain speech intelligibility. Ten human subjects with normal hearing participated in syllable-detection tasks, while their AEPs were recorded with 32-channel electroencephalography. Subjects were presented with six syllables in the form of consonant-vowel-consonant or vowel-consonant-vowel. Noise vocoding with 22 or 4 frequency channels was applied to the syllables. When examining the peak heights in the AEPs (P1, N1, and P2), vocoding alone showed no consistent effect. P1 was not consistently reduced by background noise, N1 was sometimes reduced by noise, and P2 was almost always highly reduced. Two other physiological metrics were examined: (1) classification accuracy of the syllables based on AEPs, which indicated whether AEPs were distinguishable for different syllables, and (2) cross-condition correlation of AEPs (r) between the clean and degraded speech, which indicated the brain's ability to extract speech-related features and suppress response to noise. Both metrics decreased with degraded speech quality. We further tested if the two metrics can explain cross-subject variations in their behavioral performance. A significant correlation existed for r, as well as classification based on early AEPs, in the fronto-central areas. Because r indicates similarities between clean and degraded speech, our finding suggests that high speech intelligibility may be a result of the brain's ability to ignore noise in the sound carrier and/or background.

摘要

本研究探索了人类大脑对退化语音音节的生理反应。通过噪声声码化和/或背景噪声引入退化。目标是确定听觉诱发电位(AEPs)的生理特征,这些特征可能解释言语可懂度。10 名听力正常的人类受试者参与了音节检测任务,同时记录了他们的 32 通道脑电图的 AEPs。受试者以辅音-元音-辅音或元音-辅音-元音的形式呈现了六个音节。对音节应用了 22 或 4 个频率通道的噪声声码化。在检查 AEPs 中的峰值高度(P1、N1 和 P2)时,单独的声码化没有一致的效果。背景噪声没有一致地降低 P1,噪声有时会降低 N1,而 P2 几乎总是高度降低。还检查了另外两个生理指标:(1)基于 AEPs 的音节分类准确率,这表明 AEPs 是否可以区分不同的音节,以及(2)AEPs 之间的条件间相关性(r)在干净和退化语音之间,这表明大脑提取语音相关特征和抑制对噪声响应的能力。这两个指标都随语音质量的退化而降低。我们进一步测试了这两个指标是否可以解释其行为表现的跨受试者变化。在额中央区域,r 以及基于早期 AEPs 的分类都存在显著相关性。由于 r 表示干净和退化语音之间的相似性,我们的发现表明,高言语可懂度可能是大脑忽略声音载体和/或背景噪声的能力的结果。

相似文献

本文引用的文献

3
Dynamic Encoding of Acoustic Features in Neural Responses to Continuous Speech.对连续语音的神经反应中声学特征的动态编码
J Neurosci. 2017 Feb 22;37(8):2176-2185. doi: 10.1523/JNEUROSCI.2383-16.2017. Epub 2017 Jan 24.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验