在同时出现的语音的皮质分离过程中，噪声和音高相互作用。

Noise and pitch interact during the cortical segregation of concurrent speech.

作者信息

Bidelman Gavin M, Yellamsetty Anusha

机构信息

School of Communication Sciences & Disorders, University of Memphis, Memphis, TN, 38152, USA; Institute for Intelligent Systems, University of Memphis, Memphis, TN, 38152, USA; Univeristy of Tennessee Health Sciences Center, Department of Anatomy and Neurobiology, Memphis, TN, 38163, USA.

School of Communication Sciences & Disorders, University of Memphis, Memphis, TN, 38152, USA.

出版信息

Hear Res. 2017 Aug;351:34-44. doi: 10.1016/j.heares.2017.05.008. Epub 2017 May 25.

DOI:10.1016/j.heares.2017.05.008

PMID:28578876

Abstract

Behavioral studies reveal listeners exploit intrinsic differences in voice fundamental frequency (F0) to segregate concurrent speech sounds-the so-called "F0-benefit." More favorable signal-to-noise ratio (SNR) in the environment, an extrinsic acoustic factor, similarly benefits the parsing of simultaneous speech. Here, we examined the neurobiological substrates of these two cues in the perceptual segregation of concurrent speech mixtures. We recorded event-related brain potentials (ERPs) while listeners performed a speeded double-vowel identification task. Listeners heard two concurrent vowels whose F0 differed by zero or four semitones presented in either clean (no noise) or noise-degraded (+5 dB SNR) conditions. Behaviorally, listeners were more accurate in correctly identifying both vowels for larger F0 separations but F0-benefit was more pronounced at more favorable SNRs (i.e., pitch × SNR interaction). Analysis of the ERPs revealed that only the P2 wave (∼200 ms) showed a similar F0 x SNR interaction as behavior and was correlated with listeners' perceptual F0-benefit. Neural classifiers applied to the ERPs further suggested that speech sounds are segregated neurally within 200 ms based on SNR whereas segregation based on pitch occurs later in time (400-700 ms). The earlier timing of extrinsic SNR compared to intrinsic F0-based segregation implies that the cortical extraction of speech from noise is more efficient than differentiating speech based on pitch cues alone, which may recruit additional cortical processes. Findings indicate that noise and pitch differences interact relatively early in cerebral cortex and that the brain arrives at the identities of concurrent speech mixtures as early as ∼200 ms.

摘要

行为研究表明，听众利用语音基频（F0）的内在差异来分离同时出现的语音——即所谓的“F0优势”。环境中更有利的信噪比（SNR），这一外在声学因素，同样有助于同时出现的语音的解析。在此，我们研究了这两种线索在同时出现的语音混合体的感知分离中的神经生物学基础。我们记录了事件相关脑电位（ERP），同时让听众执行一项快速双元音识别任务。听众听到两个同时出现的元音，其F0相差零或四个半音，呈现于干净（无噪声）或噪声退化（+5 dB SNR）条件下。在行为上，对于更大的F0间隔，听众更准确地正确识别两个元音，但F0优势在更有利的SNR下更为明显（即音高×SNR交互作用）。对ERP的分析表明，只有P2波（约200毫秒）表现出与行为相似的F0×SNR交互作用，并且与听众的感知F0优势相关。应用于ERP的神经分类器进一步表明，语音在200毫秒内基于SNR在神经上被分离，而基于音高的分离发生在更晚的时间（400 - 700毫秒）。与基于内在F0的分离相比，外在SNR的更早时间表明，从噪声中提取语音的皮层效率高于仅基于音高线索区分语音，后者可能需要额外的皮层过程。研究结果表明，噪声和音高差异在大脑皮层中相对较早地相互作用，并且大脑早在约200毫秒时就能确定同时出现的语音混合体的身份。

相似文献

Noise and pitch interact during the cortical segregation of concurrent speech.

Hear Res. 2017 Aug;351:34-44. doi: 10.1016/j.heares.2017.05.008. Epub 2017 May 25.

Low- and high-frequency cortical brain oscillations reflect dissociable mechanisms of concurrent speech segregation in noise.

Hear Res. 2018 Apr;361:92-102. doi: 10.1016/j.heares.2018.01.006. Epub 2018 Feb 2.

Brainstem correlates of concurrent speech identification in adverse listening conditions.

Brain Res. 2019 Jul 1;1714:182-192. doi: 10.1016/j.brainres.2019.02.025. Epub 2019 Feb 20.

Language experience-dependent advantage in pitch representation in the auditory cortex is limited to favorable signal-to-noise ratios.

Hear Res. 2017 Nov;355:42-53. doi: 10.1016/j.heares.2017.09.006. Epub 2017 Sep 14.

Effect of competing noise on cortical auditory evoked potentials elicited by speech sounds in 7- to 25-year-old listeners.

Hear Res. 2019 Mar 1;373:103-112. doi: 10.1016/j.heares.2019.01.004. Epub 2019 Jan 9.

Enhanced speech perception in noise and cortical auditory evoked potentials in professional musicians.

Int J Audiol. 2018 Jan;57(1):40-52. doi: 10.1080/14992027.2017.1380850. Epub 2017 Oct 3.

Voice segregation by difference in fundamental frequency: effect of masker type.

J Acoust Soc Am. 2013 Nov;134(5):EL465-70. doi: 10.1121/1.4826152.

Sequential stream segregation of voiced and unvoiced speech sounds based on fundamental frequency.

Hear Res. 2017 Feb;344:235-243. doi: 10.1016/j.heares.2016.11.016. Epub 2016 Dec 5.

Noise tolerance in human frequency-following responses to voice pitch.

J Acoust Soc Am. 2011 Jan;129(1):EL21-6. doi: 10.1121/1.3528775.

The relative importance of spectral cues for vowel recognition in severe noise.

J Acoust Soc Am. 2012 Oct;132(4):2652-62. doi: 10.1121/1.4751543.

引用本文的文献

Neural correlates of phonetic categorization under auditory (phoneme) and visual (grapheme) modalities.

Neuroscience. 2025 Jan 26;565:182-191. doi: 10.1016/j.neuroscience.2024.11.079. Epub 2024 Dec 2.

The Effect of Simultaneous Contralateral White Noise Masking on Cortical Auditory Evoked Potentials Elicited by Speech Stimuli.

Int Arch Otorhinolaryngol. 2024 Feb 5;28(1):e115-e121. doi: 10.1055/s-0043-1767675. eCollection 2024 Jan.

Short- and long-term neuroplasticity interact during the perceptual learning of concurrent speech.

Cereb Cortex. 2024 Jan 31;34(2). doi: 10.1093/cercor/bhad543.

Decoding Hearing-Related Changes in Older Adults' Spatiotemporal Neural Processing of Speech Using Machine Learning.

Front Neurosci. 2020 Jul 16;14:748. doi: 10.3389/fnins.2020.00748. eCollection 2020.

Brief Report: Speech-in-Noise Recognition and the Relation to Vocal Pitch Perception in Adults with Autism Spectrum Disorder and Typical Development.

J Autism Dev Disord. 2020 Jan;50(1):356-363. doi: 10.1007/s10803-019-04244-1.

Afferent-efferent connectivity between auditory brainstem and cortex accounts for poorer speech-in-noise comprehension in older adults.

Hear Res. 2019 Oct;382:107795. doi: 10.1016/j.heares.2019.107795. Epub 2019 Aug 27.

Acoustic noise and vision differentially warp the auditory categorization of speech.

J Acoust Soc Am. 2019 Jul;146(1):60. doi: 10.1121/1.5114822.

Psychobiological Responses Reveal Audiovisual Noise Differentially Challenges Speech Recognition.

Ear Hear. 2020 Mar/Apr;41(2):268-277. doi: 10.1097/AUD.0000000000000755.

Brainstem correlates of concurrent speech identification in adverse listening conditions.

Brain Res. 2019 Jul 1;1714:182-192. doi: 10.1016/j.brainres.2019.02.025. Epub 2019 Feb 20.

Neural Correlates of Enhanced Audiovisual Processing in the Bilingual Brain.

Neuroscience. 2019 Mar 1;401:11-20. doi: 10.1016/j.neuroscience.2019.01.003. Epub 2019 Jan 9.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在同时出现的语音的皮质分离过程中，噪声和音高相互作用。

Noise and pitch interact during the cortical segregation of concurrent speech.

作者信息

Bidelman Gavin M, Yellamsetty Anusha

机构信息

School of Communication Sciences & Disorders, University of Memphis, Memphis, TN, 38152, USA.

出版信息

Hear Res. 2017 Aug;351:34-44. doi: 10.1016/j.heares.2017.05.008. Epub 2017 May 25.

DOI:10.1016/j.heares.2017.05.008

PMID:28578876

Abstract

摘要

在同时出现的语音的皮质分离过程中，噪声和音高相互作用。

Noise and pitch interact during the cortical segregation of concurrent speech.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

在同时出现的语音的皮质分离过程中，噪声和音高相互作用。

Noise and pitch interact during the cortical segregation of concurrent speech.

作者信息

机构信息

出版信息

相似文献

引用本文的文献