Department of Neurological Surgery and the Center for Integrative Neuroscience at UC San Francisco, 675 Nelson Rising Lane, San Francisco, CA, 94158, USA.
Nat Commun. 2019 Jul 30;10(1):3096. doi: 10.1038/s41467-019-10994-4.
Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance's identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate.
自然交流通常发生在对话中,在听和说的过程中,听觉和运动感觉大脑区域会有不同程度的参与。然而,以前尝试直接从人类大脑中解码言语的方法通常将听或说的任务孤立起来考虑。在这里,人类参与者听问题并大声回答,而我们使用高密度脑电图 (ECoG) 记录来检测他们何时听到或说出一个语句,然后解码该语句的身份。因为某些答案只是对某些问题的合理回答,所以我们可以使用解码的问题可能性作为上下文动态更新每个答案的先验概率。我们分别以高达 61%和 76%的准确率解码产生和感知的语句(机会为 7%和 20%)。解码的问题可能性的上下文集成显著提高了答案的解码。这些结果证明了在交互式对话环境中实时解码言语的能力,这对无法交流的患者具有重要意义。