在存在多个说话者的情况下，对语音音高的皮层追踪取决于选择性注意。

Cortical tracking of voice pitch in the presence of multiple speakers depends on selective attention.

作者信息

Brodbeck Christian, Simon Jonathan Z

机构信息

Department of Psychological Sciences, University of Connecticut, Storrs, CT, United States.

Institute for Systems Research, University of Maryland, College Park, College Park, MD, United States.

出版信息

Front Neurosci. 2022 Aug 8;16:828546. doi: 10.3389/fnins.2022.828546. eCollection 2022.

DOI:10.3389/fnins.2022.828546

PMID:36003957

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9393379/

Abstract

Voice pitch carries linguistic and non-linguistic information. Previous studies have described cortical tracking of voice pitch in clean speech, with responses reflecting both pitch strength and pitch value. However, pitch is also a powerful cue for auditory stream segregation, especially when competing streams have pitch differing in fundamental frequency, as is the case when multiple speakers talk simultaneously. We therefore investigated how cortical speech pitch tracking is affected in the presence of a second, task-irrelevant speaker. We analyzed human magnetoencephalography (MEG) responses to continuous narrative speech, presented either as a single talker in a quiet background or as a two-talker mixture of a male and a female speaker. In clean speech, voice pitch was associated with a right-dominant response, peaking at a latency of around 100 ms, consistent with previous electroencephalography and electrocorticography results. The response tracked both the presence of pitch and the relative value of the speaker's fundamental frequency. In the two-talker mixture, the pitch of the attended speaker was tracked bilaterally, regardless of whether or not there was simultaneously present pitch in the speech of the irrelevant speaker. Pitch tracking for the irrelevant speaker was reduced: only the right hemisphere still significantly tracked pitch of the unattended speaker, and only during intervals in which no pitch was present in the attended talker's speech. Taken together, these results suggest that pitch-based segregation of multiple speakers, at least as measured by macroscopic cortical tracking, is not entirely automatic but strongly dependent on selective attention.

摘要

音高承载着语言和非语言信息。先前的研究描述了在纯净语音中大脑皮层对音高的追踪，其反应反映了音高强度和音高值。然而，音高也是听觉流分离的一个有力线索，特别是当竞争流的音高在基频上不同时，就像多个说话者同时交谈的情况。因此，我们研究了在存在第二个与任务无关的说话者的情况下，大脑皮层对语音音高的追踪是如何受到影响的。我们分析了人类脑磁图（MEG）对连续叙述性语音的反应，语音呈现方式要么是在安静背景中的单个说话者，要么是男性和女性说话者的双说话者混合语音。在纯净语音中，音高与右侧优势反应相关，在大约100毫秒的潜伏期达到峰值，这与先前的脑电图和皮层脑电图结果一致。该反应追踪了音高的存在以及说话者基频的相对值。在双说话者混合语音中，被关注说话者的音高在双侧都被追踪，无论无关说话者的语音中是否同时存在音高。对无关说话者的音高追踪减少：只有右半球仍能显著追踪未被关注说话者的音高，并且仅在被关注说话者的语音中不存在音高的时间段内。综上所述，这些结果表明，至少通过宏观皮层追踪测量的基于音高的多个说话者分离并非完全自动，而是强烈依赖于选择性注意。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/51b5/9393379/e719c43a193c/fnins-16-828546-g001.jpg

相似文献

Cortical tracking of voice pitch in the presence of multiple speakers depends on selective attention.

Front Neurosci. 2022 Aug 8;16:828546. doi: 10.3389/fnins.2022.828546. eCollection 2022.

Joint population coding and temporal coherence link an attended talker's voice and location features in naturalistic multi-talker scenes.

bioRxiv. 2025 Feb 12:2024.05.13.593814. doi: 10.1101/2024.05.13.593814.

Cortical responses time-locked to continuous speech in the high-gamma band depend on selective attention.

Front Neurosci. 2023 Dec 14;17:1264453. doi: 10.3389/fnins.2023.1264453. eCollection 2023.

Cortical Responses Time-Locked to Continuous Speech in the High-Gamma Band Depend on Selective Attention.

bioRxiv. 2023 Oct 15:2023.07.20.549567. doi: 10.1101/2023.07.20.549567.

Linguistic processing of task-irrelevant speech at a cocktail party.

Elife. 2021 May 4;10:e65096. doi: 10.7554/eLife.65096.

Attentional Modulation of the Cortical Contribution to the Frequency-Following Response Evoked by Continuous Speech.

J Neurosci. 2023 Nov 1;43(44):7429-7440. doi: 10.1523/JNEUROSCI.1247-23.2023. Epub 2023 Oct 4.

Left Superior Temporal Gyrus Is Coupled to Attended Speech in a Cocktail-Party Auditory Scene.

J Neurosci. 2016 Feb 3;36(5):1596-606. doi: 10.1523/JNEUROSCI.1730-15.2016.

The Right Temporoparietal Junction Supports Speech Tracking During Selective Listening: Evidence from Concurrent EEG-fMRI.

J Neurosci. 2017 Nov 22;37(47):11505-11516. doi: 10.1523/JNEUROSCI.1007-17.2017. Epub 2017 Oct 23.

Cortical Tracking of Speech-in-Noise Develops from Childhood to Adulthood.

J Neurosci. 2019 Apr 10;39(15):2938-2950. doi: 10.1523/JNEUROSCI.1732-18.2019. Epub 2019 Feb 11.

Perception of pitch location within a speaker's range: fundamental frequency, voice quality and speaker sex.

J Acoust Soc Am. 2012 Aug;132(2):1100-12. doi: 10.1121/1.4714351.

引用本文的文献

Linear modeling of brain activity during selective attention to continuous speech: the critical role of the N1 effect in event-related potentials to acoustic edges.

Cogn Neurodyn. 2025 Dec;19(1):110. doi: 10.1007/s11571-025-10289-z. Epub 2025 Jul 2.

Neural speech tracking in a virtual acoustic environment: audio-visual benefit for unscripted continuous speech.

Front Hum Neurosci. 2025 Apr 9;19:1560558. doi: 10.3389/fnhum.2025.1560558. eCollection 2025.

Dynamics of Pitch Perception in the Auditory Cortex.

J Neurosci. 2025 Mar 19;45(12):e1111242025. doi: 10.1523/JNEUROSCI.1111-24.2025.

EEG-based cross-subject passive music pitch perception using deep learning models.

Cogn Neurodyn. 2025 Dec;19(1):6. doi: 10.1007/s11571-024-10196-9. Epub 2025 Jan 3.

Neural encoding of melodic expectations in music across EEG frequency bands.

Eur J Neurosci. 2024 Dec;60(11):6734-6749. doi: 10.1111/ejn.16581. Epub 2024 Oct 29.

Episodic long-term memory formation during slow-wave sleep.

Elife. 2024 Apr 25;12:RP89601. doi: 10.7554/eLife.89601.

Attentional Modulation of the Cortical Contribution to the Frequency-Following Response Evoked by Continuous Speech.

J Neurosci. 2023 Nov 1;43(44):7429-7440. doi: 10.1523/JNEUROSCI.1247-23.2023. Epub 2023 Oct 4.

Hemispheric asymmetries for music and speech: Spectrotemporal modulations and top-down influences.

Front Neurosci. 2022 Dec 20;16:1075511. doi: 10.3389/fnins.2022.1075511. eCollection 2022.

Cocktail party training induces increased speech intelligibility and decreased cortical activity in bilateral inferior frontal gyri. A functional near-infrared study.

PLoS One. 2022 Dec 1;17(12):e0277801. doi: 10.1371/journal.pone.0277801. eCollection 2022.

本文引用的文献

Eelbrain, a Python toolkit for time-continuous analysis with temporal response functions.

Elife. 2023 Nov 29;12:e85012. doi: 10.7554/eLife.85012.

Neural Markers of Speech Comprehension: Measuring EEG Tracking of Linguistic Speech Representations, Controlling the Speech Acoustics.

J Neurosci. 2021 Dec 15;41(50):10316-10329. doi: 10.1523/JNEUROSCI.0812-21.2021. Epub 2021 Nov 3.

The neural processing of pitch accents in continuous speech.

Neuropsychologia. 2021 Jul 30;158:107883. doi: 10.1016/j.neuropsychologia.2021.107883. Epub 2021 May 11.

Enhanced Neural Tracking of the Fundamental Frequency of the Voice.

IEEE Trans Biomed Eng. 2021 Dec;68(12):3612-3619. doi: 10.1109/TBME.2021.3080123. Epub 2021 Nov 19.

Neural tracking of the fundamental frequency of the voice: The effect of voice characteristics.

Eur J Neurosci. 2021 Jun;53(11):3640-3653. doi: 10.1111/ejn.15229. Epub 2021 Apr 27.

Inhibitory effect of tDCS on auditory evoked response: Simultaneous MEG-tDCS reveals causal role of right auditory cortex in pitch learning.

Neuroimage. 2021 Jun;233:117915. doi: 10.1016/j.neuroimage.2021.117915. Epub 2021 Feb 27.

Human cortical encoding of pitch in tonal and non-tonal languages.

Nat Commun. 2021 Feb 19;12(1):1161. doi: 10.1038/s41467-021-21430-x.

Early cortical processing of pitch height and the role of adaptation and musicality.

Neuroimage. 2021 Jan 15;225:117501. doi: 10.1016/j.neuroimage.2020.117501. Epub 2020 Oct 24.

Neural speech restoration at the cocktail party: Auditory cortex recovers masked speech of both attended and ignored speakers.

PLoS Biol. 2020 Oct 22;18(10):e3000883. doi: 10.1371/journal.pbio.3000883. eCollection 2020 Oct.

High gamma cortical processing of continuous speech in younger and older listeners.

Neuroimage. 2020 Nov 15;222:117291. doi: 10.1016/j.neuroimage.2020.117291. Epub 2020 Aug 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在存在多个说话者的情况下，对语音音高的皮层追踪取决于选择性注意。

Cortical tracking of voice pitch in the presence of multiple speakers depends on selective attention.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献