• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

选择性皮层对多说话人语音感知中被注意说话人的代表。

Selective cortical representation of attended speaker in multi-talker speech perception.

机构信息

Departments of Neurological Surgery and Physiology, UCSF Center for Integrative Neuroscience, University of California, San Francisco, California 94143, USA.

出版信息

Nature. 2012 May 10;485(7397):233-6. doi: 10.1038/nature11020.

DOI:10.1038/nature11020
PMID:22522927
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3870007/
Abstract

Humans possess a remarkable ability to attend to a single speaker's voice in a multi-talker background. How the auditory system manages to extract intelligible speech under such acoustically complex and adverse listening conditions is not known, and, indeed, it is not clear how attended speech is internally represented. Here, using multi-electrode surface recordings from the cortex of subjects engaged in a listening task with two simultaneous speakers, we demonstrate that population responses in non-primary human auditory cortex encode critical features of attended speech: speech spectrograms reconstructed based on cortical responses to the mixture of speakers reveal the salient spectral and temporal features of the attended speaker, as if subjects were listening to that speaker alone. A simple classifier trained solely on examples of single speakers can decode both attended words and speaker identity. We find that task performance is well predicted by a rapid increase in attention-modulated neural selectivity across both single-electrode and population-level cortical responses. These findings demonstrate that the cortical representation of speech does not merely reflect the external acoustic environment, but instead gives rise to the perceptual aspects relevant for the listener's intended goal.

摘要

人类拥有在多说话者背景下专注于单个说话者声音的非凡能力。听觉系统如何在如此复杂和不利的听觉条件下设法提取可理解的语音尚不清楚,实际上,也不清楚被关注的语音是如何在内部表示的。在这里,我们使用来自参与两个同时说话者的聆听任务的受试者的皮层的多电极表面记录,证明非主要人类听觉皮层中的群体反应编码了被关注的语音的关键特征:基于对混合说话者的皮层反应重建的语音频谱图揭示了被关注的说话者的显著频谱和时间特征,就好像受试者只是在听那个说话者。仅在单个说话者的示例上进行训练的简单分类器可以解码被关注的单词和说话者身份。我们发现,任务表现可以很好地由单电极和群体水平皮层反应中的注意力调节神经选择性的快速增加来预测。这些发现表明,语音的皮层表示不仅仅反映了外部声学环境,而是产生了与听众预期目标相关的感知方面。

相似文献

1
Selective cortical representation of attended speaker in multi-talker speech perception.选择性皮层对多说话人语音感知中被注意说话人的代表。
Nature. 2012 May 10;485(7397):233-6. doi: 10.1038/nature11020.
2
Noise-robust cortical tracking of attended speech in real-world acoustic scenes.在真实声学场景中对注意到的语音进行抗噪皮层追踪。
Neuroimage. 2017 Aug 1;156:435-444. doi: 10.1016/j.neuroimage.2017.04.026. Epub 2017 Apr 13.
3
Hierarchical Encoding of Attended Auditory Objects in Multi-talker Speech Perception.多说话人语音感知中被注意听觉对象的分层编码。
Neuron. 2019 Dec 18;104(6):1195-1209.e3. doi: 10.1016/j.neuron.2019.09.007. Epub 2019 Oct 21.
4
Neural decoding of attentional selection in multi-speaker environments without access to clean sources.多说话人环境中无法访问干净源时的注意力选择的神经解码。
J Neural Eng. 2017 Oct;14(5):056001. doi: 10.1088/1741-2552/aa7ab4. Epub 2017 Aug 4.
5
Task-dependent decoding of speaker and vowel identity from auditory cortical response patterns.基于听觉皮层反应模式的说话人及元音身份的任务相关解码
J Neurosci. 2014 Mar 26;34(13):4548-57. doi: 10.1523/JNEUROSCI.4339-13.2014.
6
Left Superior Temporal Gyrus Is Coupled to Attended Speech in a Cocktail-Party Auditory Scene.左颞上回在鸡尾酒会听觉场景中与被关注的语音相关联。
J Neurosci. 2016 Feb 3;36(5):1596-606. doi: 10.1523/JNEUROSCI.1730-15.2016.
7
EEG-based auditory attention detection: boundary conditions for background noise and speaker positions.基于脑电图的听觉注意力检测:背景噪声和说话人位置的边界条件。
J Neural Eng. 2018 Dec;15(6):066017. doi: 10.1088/1741-2552/aae0a6. Epub 2018 Sep 12.
8
Joint population coding and temporal coherence link an attended talker's voice and location features in naturalistic multi-talker scenes.在自然主义的多说话者场景中,联合群体编码和时间连贯性将被关注说话者的语音和位置特征联系起来。
bioRxiv. 2025 Feb 12:2024.05.13.593814. doi: 10.1101/2024.05.13.593814.
9
Improved Decoding of Attentional Selection in Multi-Talker Environments with Self-Supervised Learned Speech Representation.利用自监督学习的语音表征改进多说话者环境中注意力选择的解码
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-5. doi: 10.1109/EMBC40787.2023.10340191.
10
Cortical processing of distracting speech in noisy auditory scenes depends on perceptual demand.听觉场景中分散性言语的皮质处理依赖于感知需求。
Neuroimage. 2021 Mar;228:117670. doi: 10.1016/j.neuroimage.2020.117670. Epub 2020 Dec 24.

引用本文的文献

1
Burnout and the Brain-A Mechanistic Review of Magnetic Resonance Imaging (MRI) Studies.职业倦怠与大脑——磁共振成像(MRI)研究的机制综述
Int J Mol Sci. 2025 Aug 28;26(17):8379. doi: 10.3390/ijms26178379.
2
Dynamic representation of sound locations during task engagement in marmoset auditory cortex.狨猴听觉皮层在任务参与过程中声音位置的动态表征。
bioRxiv. 2025 Aug 19:2025.08.14.669832. doi: 10.1101/2025.08.14.669832.
3
Neural speech tracking in noise reflects the opposing influence of SNR on intelligibility and attentional effort.

本文引用的文献

1
Reconstructing speech from human auditory cortex.从人类听觉皮层重建语音。
PLoS Biol. 2012 Jan;10(1):e1001251. doi: 10.1371/journal.pbio.1001251. Epub 2012 Jan 31.
2
Tuning of the human neocortex to the temporal dynamics of attended events.人类新皮层对关注事件的时间动态的调整。
J Neurosci. 2011 Mar 2;31(9):3176-85. doi: 10.1523/JNEUROSCI.4518-10.2011.
3
Auditory grouping.听觉分组。
噪声环境下的神经语音跟踪反映了信噪比在可懂度和注意力投入方面的相反影响。
Imaging Neurosci (Camb). 2025 Aug 28;3. doi: 10.1162/IMAG.a.126. eCollection 2025.
4
Exploring an EM-algorithm for banded regression in computational neuroscience.探索计算神经科学中带状回归的期望最大化算法。
Imaging Neurosci (Camb). 2024 May 20;2. doi: 10.1162/imag_a_00155. eCollection 2024.
5
Bridging verbal coordination and neural dynamics.架起言语协调与神经动力学之间的桥梁。
Elife. 2025 Aug 6;13:RP99547. doi: 10.7554/eLife.99547.
6
Individual Noise-Tolerance Profiles and Neural Signal-to-Noise Ratio: Insights into Predicting Speech-in-Noise Performance and Noise-Reduction Outcomes.个体噪声耐受曲线与神经信噪比:对预测噪声环境下言语表现及降噪效果的见解
Audiol Res. 2025 Jul 2;15(4):78. doi: 10.3390/audiolres15040078.
7
Reduced Neural Speech Tracking in Adolescents with Listening Difficulty.听力困难青少年的神经语音跟踪能力下降。
medRxiv. 2025 Jun 24:2025.06.24.25330187. doi: 10.1101/2025.06.24.25330187.
8
Optimized feature gains explain and predict successes and failures of human selective listening.优化后的特征增益能够解释并预测人类选择性听力的成败。
bioRxiv. 2025 May 28:2025.05.28.656682. doi: 10.1101/2025.05.28.656682.
9
Neural Speech Tracking during Selective Attention: A Spatially Realistic Audiovisual Study.选择性注意期间的神经语音追踪:一项空间逼真的视听研究。
eNeuro. 2025 Jun 24;12(6). doi: 10.1523/ENEURO.0132-24.2025. Print 2025 Jun.
10
Le Petit Prince (LPP) multi-talker: Naturalistic 7 T fMRI and EEG dataset.《小王子》多说话者:自然主义7T功能磁共振成像和脑电图数据集
Sci Data. 2025 May 20;12(1):829. doi: 10.1038/s41597-025-05158-7.
Trends Cogn Sci. 1997 Dec;1(9):327-33. doi: 10.1016/S1364-6613(97)01097-8.
4
Temporal coherence and attention in auditory scene analysis.听觉场景分析中的时间连贯性和注意力。
Trends Neurosci. 2011 Mar;34(3):114-23. doi: 10.1016/j.tins.2010.11.002. Epub 2010 Dec 31.
5
Categorical speech representation in human superior temporal gyrus.人类上颞叶中的范畴言语表征。
Nat Neurosci. 2010 Nov;13(11):1428-32. doi: 10.1038/nn.2641. Epub 2010 Oct 3.
6
Information flow in the auditory cortical network.听觉皮层网络中的信息流。
Hear Res. 2011 Jan;271(1-2):133-46. doi: 10.1016/j.heares.2010.01.011. Epub 2010 Jan 29.
7
Attentional gain control of ongoing cortical speech representations in a "cocktail party".鸡尾酒会中的持续皮质言语表征的注意增益控制
J Neurosci. 2010 Jan 13;30(2):620-8. doi: 10.1523/JNEUROSCI.3631-09.2010.
8
Influence of context and behavior on stimulus reconstruction from neural activity in primary auditory cortex.初级听觉皮层神经活动中刺激重构的上下文和行为影响。
J Neurophysiol. 2009 Dec;102(6):3329-39. doi: 10.1152/jn.91128.2008. Epub 2009 Sep 16.
9
Interaction between attention and bottom-up saliency mediates the representation of foreground and background in an auditory scene.注意力与自下而上的显著性之间的相互作用介导了听觉场景中前景和背景的表征。
PLoS Biol. 2009 Jun;7(6):e1000129. doi: 10.1371/journal.pbio.1000129. Epub 2009 Jun 16.
10
The neural processing of masked speech: evidence for different mechanisms in the left and right temporal lobes.掩蔽言语的神经处理:左右颞叶不同机制的证据。
J Acoust Soc Am. 2009 Mar;125(3):1737-43. doi: 10.1121/1.3050255.