• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多说话人环境中分层言语表征的注意调制。

Attentional Modulation of Hierarchical Speech Representations in a Multitalker Environment.

机构信息

Neuroscience Program, Sabuncu Brain Research Center, Bilkent University, Ankara TR-06800, Turkey.

National Magnetic Resonance Research Center (UMRAM), Bilkent University, Ankara TR-06800, Turkey.

出版信息

Cereb Cortex. 2021 Oct 1;31(11):4986-5005. doi: 10.1093/cercor/bhab136.

DOI:10.1093/cercor/bhab136
PMID:34115102
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8491717/
Abstract

Humans are remarkably adept in listening to a desired speaker in a crowded environment, while filtering out nontarget speakers in the background. Attention is key to solving this difficult cocktail-party task, yet a detailed characterization of attentional effects on speech representations is lacking. It remains unclear across what levels of speech features and how much attentional modulation occurs in each brain area during the cocktail-party task. To address these questions, we recorded whole-brain blood-oxygen-level-dependent (BOLD) responses while subjects either passively listened to single-speaker stories, or selectively attended to a male or a female speaker in temporally overlaid stories in separate experiments. Spectral, articulatory, and semantic models of the natural stories were constructed. Intrinsic selectivity profiles were identified via voxelwise models fit to passive listening responses. Attentional modulations were then quantified based on model predictions for attended and unattended stories in the cocktail-party task. We find that attention causes broad modulations at multiple levels of speech representations while growing stronger toward later stages of processing, and that unattended speech is represented up to the semantic level in parabelt auditory cortex. These results provide insights on attentional mechanisms that underlie the ability to selectively listen to a desired speaker in noisy multispeaker environments.

摘要

人类在嘈杂的环境中倾听目标说话者的能力非常出色,同时可以过滤背景中的非目标说话者。注意力是解决这个困难的鸡尾酒会任务的关键,但注意力对语音表示的影响的详细特征描述还很缺乏。在鸡尾酒会任务中,大脑区域在什么层次的语音特征上发生了多少注意力调制,这一点还不清楚。为了解决这些问题,我们在被试者被动地听单个说话者的故事,或者在分别的实验中选择性地注意到时间上重叠的故事中的男性或女性说话者时,记录了全脑血氧水平依赖(BOLD)反应。构建了自然故事的频谱、发音和语义模型。通过对被动聆听反应进行拟合的体素模型,确定了内在选择性轮廓。然后,根据鸡尾酒会任务中对注意和未注意故事的模型预测,量化了注意力调制。我们发现,注意力引起了多个语音表示层次的广泛调制,并且随着处理阶段的推进,调制变得越来越强,未注意的语音在副听觉皮层中一直被表示到语义水平。这些结果为选择性地在嘈杂的多说话者环境中倾听目标说话者的注意力机制提供了深入的了解。

相似文献

1
Attentional Modulation of Hierarchical Speech Representations in a Multitalker Environment.多说话人环境中分层言语表征的注意调制。
Cereb Cortex. 2021 Oct 1;31(11):4986-5005. doi: 10.1093/cercor/bhab136.
2
Left Superior Temporal Gyrus Is Coupled to Attended Speech in a Cocktail-Party Auditory Scene.左颞上回在鸡尾酒会听觉场景中与被关注的语音相关联。
J Neurosci. 2016 Feb 3;36(5):1596-606. doi: 10.1523/JNEUROSCI.1730-15.2016.
3
Attention Differentially Affects Acoustic and Phonetic Feature Encoding in a Multispeaker Environment.注意在多说话人环境中对声学和语音特征编码的影响不同。
J Neurosci. 2022 Jan 26;42(4):682-691. doi: 10.1523/JNEUROSCI.1455-20.2021. Epub 2021 Dec 10.
4
Cortical Representations of Speech in a Multitalker Auditory Scene.多说话者听觉场景中语音的皮质表征
J Neurosci. 2017 Sep 20;37(38):9189-9196. doi: 10.1523/JNEUROSCI.0938-17.2017. Epub 2017 Aug 18.
5
Breaking down the cocktail party: Attentional modulation of cerebral audiovisual speech processing.解析鸡尾酒会效应:注意力对大脑视听言语加工的调制。
Neuroimage. 2021 Jan 1;224:117365. doi: 10.1016/j.neuroimage.2020.117365. Epub 2020 Sep 14.
6
Hierarchical Encoding of Attended Auditory Objects in Multi-talker Speech Perception.多说话人语音感知中被注意听觉对象的分层编码。
Neuron. 2019 Dec 18;104(6):1195-1209.e3. doi: 10.1016/j.neuron.2019.09.007. Epub 2019 Oct 21.
7
Attentional gain control of ongoing cortical speech representations in a "cocktail party".鸡尾酒会中的持续皮质言语表征的注意增益控制
J Neurosci. 2010 Jan 13;30(2):620-8. doi: 10.1523/JNEUROSCI.3631-09.2010.
8
Inferring Mechanisms of Auditory Attentional Modulation with Deep Neural Networks.基于深度神经网络推断听觉注意力调制的机制
Neural Comput. 2022 Oct 7;34(11):2273-2293. doi: 10.1162/neco_a_01537.
9
The Right Temporoparietal Junction Supports Speech Tracking During Selective Listening: Evidence from Concurrent EEG-fMRI.右侧颞顶联合区在选择性倾听过程中支持言语追踪:来自同步脑电图-功能磁共振成像的证据。
J Neurosci. 2017 Nov 22;37(47):11505-11516. doi: 10.1523/JNEUROSCI.1007-17.2017. Epub 2017 Oct 23.
10
Neural mechanisms for selectively tuning in to the target speaker in a naturalistic noisy situation.在自然嘈杂环境中选择性地调谐到目标说话人的神经机制。
Nat Commun. 2018 Jun 19;9(1):2405. doi: 10.1038/s41467-018-04819-z.

引用本文的文献

1
Whole-brain dynamics of articulatory, acoustic and semantic speech representations.发音、声学和语义语音表征的全脑动力学。
Commun Biol. 2025 Mar 13;8(1):432. doi: 10.1038/s42003-025-07862-x.
2
Attention to audiovisual speech shapes neural processing through feedback-feedforward loops between different nodes of the speech network.听觉-视觉言语会通过言语网络不同节点之间的反馈-前馈回路来影响神经处理过程。
PLoS Biol. 2024 Mar 11;22(3):e3002534. doi: 10.1371/journal.pbio.3002534. eCollection 2024 Mar.
3
Attention-Driven Modulation of Auditory Cortex Activity during Selective Listening in a Multispeaker Setting.多说话人环境下选择性倾听时听觉皮层活动的注意驱动调制。
J Neurosci. 2024 Apr 10;44(15):e1157232023. doi: 10.1523/JNEUROSCI.1157-23.2023.
4
Cortical Tracking of Continuous Speech Under Bimodal Divided Attention.双峰式分散注意力下连续语音的皮质追踪
Neurobiol Lang (Camb). 2023 Apr 11;4(2):318-343. doi: 10.1162/nol_a_00100. eCollection 2023.
5
Semantic reconstruction of continuous language from non-invasive brain recordings.从非侵入性脑记录中重建连续语言的语义。
Nat Neurosci. 2023 May;26(5):858-866. doi: 10.1038/s41593-023-01304-9. Epub 2023 May 1.
6
Exploring Hierarchical Auditory Representation a Neural Encoding Model.探索分层听觉表征:一种神经编码模型
Front Neurosci. 2022 Mar 24;16:843988. doi: 10.3389/fnins.2022.843988. eCollection 2022.

本文引用的文献

1
Breaking down the cocktail party: Attentional modulation of cerebral audiovisual speech processing.解析鸡尾酒会效应:注意力对大脑视听言语加工的调制。
Neuroimage. 2021 Jan 1;224:117365. doi: 10.1016/j.neuroimage.2020.117365. Epub 2020 Sep 14.
2
Hierarchical Encoding of Attended Auditory Objects in Multi-talker Speech Perception.多说话人语音感知中被注意听觉对象的分层编码。
Neuron. 2019 Dec 18;104(6):1195-1209.e3. doi: 10.1016/j.neuron.2019.09.007. Epub 2019 Oct 21.
3
Cortical encoding of speech enhances task-relevant acoustic information.大脑皮层对言语的编码增强了与任务相关的声学信息。
Nat Hum Behav. 2019 Sep;3(9):974-987. doi: 10.1038/s41562-019-0648-9. Epub 2019 Jul 8.
4
Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech.连续语音的听觉到语言表示的快速转换。
Curr Biol. 2018 Dec 17;28(24):3976-3983.e5. doi: 10.1016/j.cub.2018.10.042. Epub 2018 Nov 29.
5
Propagation of Information Along the Cortical Hierarchy as a Function of Attention While Reading and Listening to Stories.信息在阅读和聆听故事时沿着皮层层次结构的传播与注意力的关系。
Cereb Cortex. 2019 Sep 13;29(10):4017-4034. doi: 10.1093/cercor/bhy282.
6
Musicians at the Cocktail Party: Neural Substrates of Musical Training During Selective Listening in Multispeaker Situations.鸡尾酒会上的音乐家:多说话者情境下选择性聆听中音乐训练的神经基础。
Cereb Cortex. 2019 Jul 22;29(8):3253-3265. doi: 10.1093/cercor/bhy193.
7
Neural source dynamics of brain responses to continuous stimuli: Speech processing from acoustics to comprehension.连续刺激下大脑反应的神经源动力学:从声学处理到理解的言语加工。
Neuroimage. 2018 May 15;172:162-174. doi: 10.1016/j.neuroimage.2018.01.042. Epub 2018 Feb 3.
8
Attention Is Required for Knowledge-Based Sequential Grouping: Insights from the Integration of Syllables into Words.注意:基于知识的序列分组需要注意:从音节到单词的整合中得到的启示。
J Neurosci. 2018 Jan 31;38(5):1178-1188. doi: 10.1523/JNEUROSCI.2606-17.2017. Epub 2017 Dec 18.
9
The Right Temporoparietal Junction Supports Speech Tracking During Selective Listening: Evidence from Concurrent EEG-fMRI.右侧颞顶联合区在选择性倾听过程中支持言语追踪:来自同步脑电图-功能磁共振成像的证据。
J Neurosci. 2017 Nov 22;37(47):11505-11516. doi: 10.1523/JNEUROSCI.1007-17.2017. Epub 2017 Oct 23.
10
The Effects of Audiovisual Inputs on Solving the Cocktail Party Problem in the Human Brain: An fMRI Study.听觉和视觉输入对人脑解决鸡尾酒会问题的影响:一项 fMRI 研究。
Cereb Cortex. 2018 Oct 1;28(10):3623-3637. doi: 10.1093/cercor/bhx235.