O'Sullivan James A, Power Alan J, Mesgarani Nima, Rajaram Siddharth, Foxe John J, Shinn-Cunningham Barbara G, Slaney Malcolm, Shamma Shihab A, Lalor Edmund C
School of Engineering, Trinity Centre for Bioengineering and Trinity College Institute of Neuroscience, Trinity College Dublin, Dublin 2, Ireland.
School of Engineering, Trinity Centre for Bioengineering and Trinity College Institute of Neuroscience, Trinity College Dublin, Dublin 2, Ireland Department of Psychology, Centre for Neuroscience in Education, University of Cambridge, Cambridge, UK.
Cereb Cortex. 2015 Jul;25(7):1697-706. doi: 10.1093/cercor/bht355. Epub 2014 Jan 15.
How humans solve the cocktail party problem remains unknown. However, progress has been made recently thanks to the realization that cortical activity tracks the amplitude envelope of speech. This has led to the development of regression methods for studying the neurophysiology of continuous speech. One such method, known as stimulus-reconstruction, has been successfully utilized with cortical surface recordings and magnetoencephalography (MEG). However, the former is invasive and gives a relatively restricted view of processing along the auditory hierarchy, whereas the latter is expensive and rare. Thus it would be extremely useful for research in many populations if stimulus-reconstruction was effective using electroencephalography (EEG), a widely available and inexpensive technology. Here we show that single-trial (≈60 s) unaveraged EEG data can be decoded to determine attentional selection in a naturalistic multispeaker environment. Furthermore, we show a significant correlation between our EEG-based measure of attention and performance on a high-level attention task. In addition, by attempting to decode attention at individual latencies, we identify neural processing at ∼200 ms as being critical for solving the cocktail party problem. These findings open up new avenues for studying the ongoing dynamics of cognition using EEG and for developing effective and natural brain-computer interfaces.
人类如何解决鸡尾酒会问题仍然未知。然而,由于认识到皮层活动跟踪语音的幅度包络,最近取得了进展。这导致了用于研究连续语音神经生理学的回归方法的发展。一种这样的方法,称为刺激重建,已成功应用于皮层表面记录和脑磁图(MEG)。然而,前者具有侵入性,并且对沿听觉层次的处理提供了相对受限的视角,而后者昂贵且稀少。因此,如果使用脑电图(EEG)(一种广泛可用且廉价的技术)刺激重建有效,那么对于许多人群的研究将极其有用。在这里,我们表明单次试验(约60秒)未平均的EEG数据可以被解码,以确定自然主义多说话者环境中的注意力选择。此外,我们展示了基于EEG的注意力测量与高级注意力任务表现之间的显著相关性。此外,通过尝试在各个潜伏期解码注意力,我们确定约200毫秒时的神经处理对于解决鸡尾酒会问题至关重要。这些发现为使用EEG研究认知的持续动态以及开发有效和自然的脑机接口开辟了新途径。