噪声变码与背景噪声下的言语感知：一项 EEG 和行为研究。

Speech Perception with Noise Vocoding and Background Noise: An EEG and Behavioral Study.

机构信息

Biomedical Engineering, Parks College of Engineering, Aviation and Technology, Saint Louis University, 3507 Lindell Blvd, St Louis, MO, 63103, USA.

出版信息

J Assoc Res Otolaryngol. 2021 Jun;22(3):349-363. doi: 10.1007/s10162-021-00787-2. Epub 2021 Apr 13.

DOI:10.1007/s10162-021-00787-2

PMID:33851289

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8110670/

Abstract

This study explored the physiological response of the human brain to degraded speech syllables. The degradation was introduced using noise vocoding and/or background noise. The goal was to identify physiological features of auditory-evoked potentials (AEPs) that may explain speech intelligibility. Ten human subjects with normal hearing participated in syllable-detection tasks, while their AEPs were recorded with 32-channel electroencephalography. Subjects were presented with six syllables in the form of consonant-vowel-consonant or vowel-consonant-vowel. Noise vocoding with 22 or 4 frequency channels was applied to the syllables. When examining the peak heights in the AEPs (P1, N1, and P2), vocoding alone showed no consistent effect. P1 was not consistently reduced by background noise, N1 was sometimes reduced by noise, and P2 was almost always highly reduced. Two other physiological metrics were examined: (1) classification accuracy of the syllables based on AEPs, which indicated whether AEPs were distinguishable for different syllables, and (2) cross-condition correlation of AEPs (r) between the clean and degraded speech, which indicated the brain's ability to extract speech-related features and suppress response to noise. Both metrics decreased with degraded speech quality. We further tested if the two metrics can explain cross-subject variations in their behavioral performance. A significant correlation existed for r, as well as classification based on early AEPs, in the fronto-central areas. Because r indicates similarities between clean and degraded speech, our finding suggests that high speech intelligibility may be a result of the brain's ability to ignore noise in the sound carrier and/or background.

摘要

本研究探索了人类大脑对退化语音音节的生理反应。通过噪声声码化和/或背景噪声引入退化。目标是确定听觉诱发电位（AEPs）的生理特征，这些特征可能解释言语可懂度。10 名听力正常的人类受试者参与了音节检测任务，同时记录了他们的 32 通道脑电图的 AEPs。受试者以辅音-元音-辅音或元音-辅音-元音的形式呈现了六个音节。对音节应用了 22 或 4 个频率通道的噪声声码化。在检查 AEPs 中的峰值高度（P1、N1 和 P2）时，单独的声码化没有一致的效果。背景噪声没有一致地降低 P1，噪声有时会降低 N1，而 P2 几乎总是高度降低。还检查了另外两个生理指标：（1）基于 AEPs 的音节分类准确率，这表明 AEPs 是否可以区分不同的音节，以及（2）AEPs 之间的条件间相关性（r）在干净和退化语音之间，这表明大脑提取语音相关特征和抑制对噪声响应的能力。这两个指标都随语音质量的退化而降低。我们进一步测试了这两个指标是否可以解释其行为表现的跨受试者变化。在额中央区域，r 以及基于早期 AEPs 的分类都存在显著相关性。由于 r 表示干净和退化语音之间的相似性，我们的发现表明，高言语可懂度可能是大脑忽略声音载体和/或背景噪声的能力的结果。

相似文献

Speech Perception with Noise Vocoding and Background Noise: An EEG and Behavioral Study.

J Assoc Res Otolaryngol. 2021 Jun;22(3):349-363. doi: 10.1007/s10162-021-00787-2. Epub 2021 Apr 13.

Contribution of spectrotemporal features on auditory event-related potentials elicited by consonant-vowel syllables.

Ear Hear. 2009 Dec;30(6):704-12. doi: 10.1097/AUD.0b013e3181b1d42d.

Temporal encoding of the voice onset time phonetic parameter by field potentials recorded directly from human auditory cortex.

J Neurophysiol. 1999 Nov;82(5):2346-57. doi: 10.1152/jn.1999.82.5.2346.

Effect of Consonant Duration on Formation of Consonant-Vowel Syllable Evoked Auditory Cortical Potentials.

J Int Adv Otol. 2018 Apr;14(1):39-43. doi: 10.5152/iao.2017.3389. Epub 2017 Nov 2.

Varying effect of noise on sound onset and acoustic change evoked auditory cortical N1 responses evoked by a vowel-vowel stimulus.

Int J Psychophysiol. 2020 Jun;152:36-43. doi: 10.1016/j.ijpsycho.2020.04.010. Epub 2020 Apr 14.

Auditory cortical activity to different voice onset times in cochlear implant users.

Clin Neurophysiol. 2016 Feb;127(2):1603-1617. doi: 10.1016/j.clinph.2015.10.049. Epub 2015 Nov 10.

Evoked cortical activity and speech recognition as a function of the number of simulated cochlear implant channels.

Clin Neurophysiol. 2009 Apr;120(4):776-82. doi: 10.1016/j.clinph.2009.01.008. Epub 2009 Feb 27.

Neural indices of phonemic discrimination and sentence-level speech intelligibility in quiet and noise: A P3 study.

Hear Res. 2017 Jul;350:58-67. doi: 10.1016/j.heares.2017.04.009. Epub 2017 Apr 18.

Effects of Signal Type and Noise Background on Auditory Evoked Potential N1, P2, and P3 Measurements in Blast-Exposed Veterans.

Ear Hear. 2021 Jan/Feb;42(1):106-121. doi: 10.1097/AUD.0000000000000906.

Electrophysiological and behavioral measures of some speech contrasts in varied attention and noise.

Hear Res. 2019 Mar 1;373:1-9. doi: 10.1016/j.heares.2018.12.001. Epub 2018 Dec 6.

引用本文的文献

Neural Decoding of the Speech Envelope: Effects of Intelligibility and Spectral Degradation.

Trends Hear. 2024 Jan-Dec;28:23312165241266316. doi: 10.1177/23312165241266316.

Effect of spectral degradation on speech intelligibility and cortical representation.

Front Neurosci. 2024 Apr 5;18:1368641. doi: 10.3389/fnins.2024.1368641. eCollection 2024.

Competing Visual Cues Revealed by Electroencephalography: Sensitivity to Motion Speed and Direction.

Brain Sci. 2024 Feb 4;14(2):160. doi: 10.3390/brainsci14020160.

本文引用的文献

The effect of prior knowledge and intelligibility on the cortical entrainment response to speech.

J Neurophysiol. 2017 Dec 1;118(6):3144-3151. doi: 10.1152/jn.00023.2017. Epub 2017 Sep 6.

Evidence of a speech evoked electrophysiological release from masking in noise.

J Acoust Soc Am. 2017 Aug;142(2):EL218. doi: 10.1121/1.4998151.

Dynamic Encoding of Acoustic Features in Neural Responses to Continuous Speech.

J Neurosci. 2017 Feb 22;37(8):2176-2185. doi: 10.1523/JNEUROSCI.2383-16.2017. Epub 2017 Jan 24.

Effects of acoustic periodicity and intelligibility on the neural oscillations in response to speech.

Neuropsychologia. 2017 Jan 27;95:173-181. doi: 10.1016/j.neuropsychologia.2016.12.003. Epub 2016 Dec 7.

Neural indices of phonemic discrimination and sentence-level speech intelligibility in quiet and noise: A mismatch negativity study.

Hear Res. 2016 Sep;339:40-9. doi: 10.1016/j.heares.2016.06.001. Epub 2016 Jun 4.

Representation of spectro-temporal features of spoken words within the P1-N1-P2 and T-complex of the auditory evoked potentials (AEP).

Neurosci Lett. 2016 Feb 12;614:119-26. doi: 10.1016/j.neulet.2015.12.020. Epub 2015 Dec 14.

Nonlinear feature extraction for objective classification of complex auditory brainstem responses to diotic perceptually critical consonant-vowel syllables.

Auris Nasus Larynx. 2016 Feb;43(1):37-44. doi: 10.1016/j.anl.2015.06.003. Epub 2015 Aug 22.

Cortical characterization of the perception of intelligible and unintelligible speech measured via high-density electroencephalography.

Brain Lang. 2015 Jan;140:49-54. doi: 10.1016/j.bandl.2014.10.008. Epub 2014 Dec 13.

Direct classification of all American English phonemes using signals from functional speech motor cortex.

J Neural Eng. 2014 Jun;11(3):035015. doi: 10.1088/1741-2560/11/3/035015. Epub 2014 May 19.

EEG classification in a single-trial basis for vowel speech perception using multivariate empirical mode decomposition.

J Neural Eng. 2014 Jun;11(3):036010. doi: 10.1088/1741-2560/11/3/036010. Epub 2014 May 8.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

噪声变码与背景噪声下的言语感知：一项 EEG 和行为研究。

Speech Perception with Noise Vocoding and Background Noise: An EEG and Behavioral Study.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献