• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

视觉输入增强了“鸡尾酒会”中听觉皮层对选择性语音包络的跟踪。

Visual input enhances selective speech envelope tracking in auditory cortex at a "cocktail party".

机构信息

Department of Psychiatry, Columbia University College of Physicians and Surgeons, New York, New York, 10032, USA.

出版信息

J Neurosci. 2013 Jan 23;33(4):1417-26. doi: 10.1523/JNEUROSCI.3675-12.2013.

DOI:10.1523/JNEUROSCI.3675-12.2013
PMID:23345218
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3711546/
Abstract

Our ability to selectively attend to one auditory signal amid competing input streams, epitomized by the "Cocktail Party" problem, continues to stimulate research from various approaches. How this demanding perceptual feat is achieved from a neural systems perspective remains unclear and controversial. It is well established that neural responses to attended stimuli are enhanced compared with responses to ignored ones, but responses to ignored stimuli are nonetheless highly significant, leading to interference in performance. We investigated whether congruent visual input of an attended speaker enhances cortical selectivity in auditory cortex, leading to diminished representation of ignored stimuli. We recorded magnetoencephalographic signals from human participants as they attended to segments of natural continuous speech. Using two complementary methods of quantifying the neural response to speech, we found that viewing a speaker's face enhances the capacity of auditory cortex to track the temporal speech envelope of that speaker. This mechanism was most effective in a Cocktail Party setting, promoting preferential tracking of the attended speaker, whereas without visual input no significant attentional modulation was observed. These neurophysiological results underscore the importance of visual input in resolving perceptual ambiguity in a noisy environment. Since visual cues in speech precede the associated auditory signals, they likely serve a predictive role in facilitating auditory processing of speech, perhaps by directing attentional resources to appropriate points in time when to-be-attended acoustic input is expected to arrive.

摘要

我们能够从竞争的输入流中选择性地关注一个听觉信号,这一能力以“鸡尾酒会”问题为代表,继续激发着来自不同方法的研究。从神经系统的角度来看,这种高要求的感知能力是如何实现的,目前仍不清楚且存在争议。已经证实,与忽略的刺激相比,对关注的刺激的神经反应得到了增强,但对忽略的刺激的反应仍然非常显著,导致了性能的干扰。我们研究了被关注的说话者的一致的视觉输入是否会增强听觉皮层的皮层选择性,从而减少对忽略的刺激的表示。我们记录了人类参与者在聆听自然连续语音时的脑磁图信号。使用两种定量言语神经反应的互补方法,我们发现,观看说话者的脸会增强听觉皮层跟踪该说话者的时间言语包络的能力。这种机制在鸡尾酒会环境中最为有效,促进了对关注的说话者的优先跟踪,而没有视觉输入时则观察不到显著的注意力调节。这些神经生理学结果强调了视觉输入在嘈杂环境中解决感知歧义的重要性。由于言语中的视觉线索先于相关的听觉信号,它们可能在促进言语的听觉处理中起到预测作用,也许是通过将注意力资源引导到预期听觉输入到达的适当时间点。

相似文献

1
Visual input enhances selective speech envelope tracking in auditory cortex at a "cocktail party".视觉输入增强了“鸡尾酒会”中听觉皮层对选择性语音包络的跟踪。
J Neurosci. 2013 Jan 23;33(4):1417-26. doi: 10.1523/JNEUROSCI.3675-12.2013.
2
Left Superior Temporal Gyrus Is Coupled to Attended Speech in a Cocktail-Party Auditory Scene.左颞上回在鸡尾酒会听觉场景中与被关注的语音相关联。
J Neurosci. 2016 Feb 3;36(5):1596-606. doi: 10.1523/JNEUROSCI.1730-15.2016.
3
Cortical Tracking of Speech-in-Noise Develops from Childhood to Adulthood.语音噪声中皮层追踪由儿童期发展至成年期。
J Neurosci. 2019 Apr 10;39(15):2938-2950. doi: 10.1523/JNEUROSCI.1732-18.2019. Epub 2019 Feb 11.
4
Cortical Representations of Speech in a Multitalker Auditory Scene.多说话者听觉场景中语音的皮质表征
J Neurosci. 2017 Sep 20;37(38):9189-9196. doi: 10.1523/JNEUROSCI.0938-17.2017. Epub 2017 Aug 18.
5
Musicians at the Cocktail Party: Neural Substrates of Musical Training During Selective Listening in Multispeaker Situations.鸡尾酒会上的音乐家:多说话者情境下选择性聆听中音乐训练的神经基础。
Cereb Cortex. 2019 Jul 22;29(8):3253-3265. doi: 10.1093/cercor/bhy193.
6
The effects of selective attention and speech acoustics on neural speech-tracking in a multi-talker scene.在多说话者场景中选择性注意和语音声学对神经语音追踪的影响。
Cortex. 2015 Jul;68:144-54. doi: 10.1016/j.cortex.2014.12.014. Epub 2015 Jan 7.
7
Attentional Modulation of the Cortical Contribution to the Frequency-Following Response Evoked by Continuous Speech.注意对连续语音诱发的频率跟随反应的皮层贡献的调制。
J Neurosci. 2023 Nov 1;43(44):7429-7440. doi: 10.1523/JNEUROSCI.1247-23.2023. Epub 2023 Oct 4.
8
Congruent audiovisual speech enhances auditory attention decoding with EEG.视听语音一致增强了 EEG 对听觉注意力的解码。
J Neural Eng. 2019 Nov 6;16(6):066033. doi: 10.1088/1741-2552/ab4340.
9
The Right Temporoparietal Junction Supports Speech Tracking During Selective Listening: Evidence from Concurrent EEG-fMRI.右侧颞顶联合区在选择性倾听过程中支持言语追踪:来自同步脑电图-功能磁共振成像的证据。
J Neurosci. 2017 Nov 22;37(47):11505-11516. doi: 10.1523/JNEUROSCI.1007-17.2017. Epub 2017 Oct 23.
10
Congruent Visual Speech Enhances Cortical Entrainment to Continuous Auditory Speech in Noise-Free Conditions.在无噪声条件下,匹配的视觉语音增强了皮质对连续听觉语音的同步化。
J Neurosci. 2015 Oct 21;35(42):14195-204. doi: 10.1523/JNEUROSCI.1829-15.2015.

引用本文的文献

1
Children's cortical speech tracking in face-to-face and online video communication.儿童在面对面和在线视频交流中的皮层语音追踪
Sci Rep. 2025 Jun 20;15(1):20134. doi: 10.1038/s41598-025-04778-8.
2
Seeing a Talker's Mouth Reduces the Effort of Perceiving Speech and Repairing Perceptual Mistakes for Listeners With Cochlear Implants.看到说话者的嘴部动作可减轻人工耳蜗佩戴者感知语音和纠正感知错误的难度。
Ear Hear. 2025 Jun 16. doi: 10.1097/AUD.0000000000001683.
3
How strong is the rhythm of perception? A registered replication of Hickok . (2015).感知节奏有多强?希科克(2015年)的一项注册复制研究。
R Soc Open Sci. 2025 Jun 11;12(6):220497. doi: 10.1098/rsos.220497. eCollection 2025 Jun.
4
Neural Speech Tracking during Selective Attention: A Spatially Realistic Audiovisual Study.选择性注意期间的神经语音追踪:一项空间逼真的视听研究。
eNeuro. 2025 Jun 24;12(6). doi: 10.1523/ENEURO.0132-24.2025. Print 2025 Jun.
5
Synchrony perception of audiovisual speech is a reliable, yet individual construct.视听语音的同步感知是一种可靠但因人而异的结构。
Sci Rep. 2025 May 7;15(1):15909. doi: 10.1038/s41598-025-00243-8.
6
Objectively Measuring Audiovisual Effects in Noise Using Virtual Human Speakers.使用虚拟人类说话者客观测量噪声中的视听效果。
Trends Hear. 2025 Jan-Dec;29:23312165251333528. doi: 10.1177/23312165251333528. Epub 2025 Apr 13.
7
Neural Speech Tracking Contribution of Lip Movements Predicts Behavioral Deterioration When the Speaker's Mouth Is Occluded.唇部运动对神经语音追踪的贡献可预测说话者嘴巴被遮挡时的行为恶化。
eNeuro. 2025 Feb 5;12(2). doi: 10.1523/ENEURO.0368-24.2024. Print 2025 Feb.
8
Laminar organization of visual responses in core and parabelt auditory cortex.视皮层核区和旁带状区听觉反应的层状组织。
Cereb Cortex. 2024 Sep 3;34(9). doi: 10.1093/cercor/bhae373.
9
The impact of face masks on face-to-face neural tracking of speech: Auditory and visual obstacles.口罩对言语面对面神经追踪的影响:听觉和视觉障碍。
Heliyon. 2024 Jul 19;10(15):e34860. doi: 10.1016/j.heliyon.2024.e34860. eCollection 2024 Aug 15.
10
Attention to audiovisual speech shapes neural processing through feedback-feedforward loops between different nodes of the speech network.听觉-视觉言语会通过言语网络不同节点之间的反馈-前馈回路来影响神经处理过程。
PLoS Biol. 2024 Mar 11;22(3):e3002534. doi: 10.1371/journal.pbio.3002534. eCollection 2024 Mar.

本文引用的文献

1
Phase-locked responses to speech in human auditory cortex are enhanced during comprehension.人类听觉皮层对言语的锁相反应在理解过程中增强。
Cereb Cortex. 2013 Jun;23(6):1378-87. doi: 10.1093/cercor/bhs118. Epub 2012 May 17.
2
Selective cortical representation of attended speaker in multi-talker speech perception.选择性皮层对多说话人语音感知中被注意说话人的代表。
Nature. 2012 May 10;485(7397):233-6. doi: 10.1038/nature11020.
3
Cortical oscillations and speech processing: emerging computational principles and operations.皮质振荡与言语加工:新兴的计算原理与操作
Nat Neurosci. 2012 Mar 18;15(4):511-7. doi: 10.1038/nn.3063.
4
Temporal context in speech processing and attentional stream selection: a behavioral and neural perspective.言语加工中的时间语境和注意流选择:行为和神经学视角。
Brain Lang. 2012 Sep;122(3):151-61. doi: 10.1016/j.bandl.2011.12.010. Epub 2012 Jan 29.
5
Speech comprehension aided by multiple modalities: behavioural and neural interactions.多模态辅助言语理解:行为和神经的相互作用。
Neuropsychologia. 2012 Apr;50(5):762-76. doi: 10.1016/j.neuropsychologia.2012.01.010. Epub 2012 Jan 17.
6
Magnetic brain activity phase-locked to the envelope, the syllable onsets, and the fundamental frequency of a perceived speech signal.感知语音信号的包络、音节起始和基频锁相的大脑磁活动。
Psychophysiology. 2012 Mar;49(3):322-34. doi: 10.1111/j.1469-8986.2011.01314.x. Epub 2011 Dec 16.
7
Neural coding of continuous speech in auditory cortex during monaural and dichotic listening.听觉皮层在单耳和双耳聆听时对连续语音的神经编码。
J Neurophysiol. 2012 Jan;107(1):78-89. doi: 10.1152/jn.00297.2011. Epub 2011 Oct 5.
8
Linking speech perception and neurophysiology: speech decoding guided by cascaded oscillators locked to the input rhythm.连接言语感知与神经生理学:由锁定输入节奏的级联振荡器引导的言语解码。
Front Psychol. 2011 Jun 27;2:130. doi: 10.3389/fpsyg.2011.00130. eCollection 2011.
9
Transitions in neural oscillations reflect prediction errors generated in audiovisual speech.神经振荡的转变反映了视听语音中产生的预测误差。
Nat Neurosci. 2011 Jun;14(6):797-801. doi: 10.1038/nn.2810. Epub 2011 May 8.
10
FieldTrip: Open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data.FieldTrip:用于 MEG、EEG 和有创电生理数据的高级分析的开源软件。
Comput Intell Neurosci. 2011;2011:156869. doi: 10.1155/2011/156869. Epub 2010 Dec 23.