Howard Hughes Medical Institute at Stanford University, Stanford, CA, USA.
Department of Electrical Engineering, Stanford University, Stanford, CA, USA.
Nature. 2023 Aug;620(7976):1031-1036. doi: 10.1038/s41586-023-06377-x. Epub 2023 Aug 23.
Speech brain-computer interfaces (BCIs) have the potential to restore rapid communication to people with paralysis by decoding neural activity evoked by attempted speech into text or sound. Early demonstrations, although promising, have not yet achieved accuracies sufficiently high for communication of unconstrained sentences from a large vocabulary. Here we demonstrate a speech-to-text BCI that records spiking activity from intracortical microelectrode arrays. Enabled by these high-resolution recordings, our study participant-who can no longer speak intelligibly owing to amyotrophic lateral sclerosis-achieved a 9.1% word error rate on a 50-word vocabulary (2.7 times fewer errors than the previous state-of-the-art speech BCI) and a 23.8% word error rate on a 125,000-word vocabulary (the first successful demonstration, to our knowledge, of large-vocabulary decoding). Our participant's attempted speech was decoded at 62 words per minute, which is 3.4 times as fast as the previous record and begins to approach the speed of natural conversation (160 words per minute). Finally, we highlight two aspects of the neural code for speech that are encouraging for speech BCIs: spatially intermixed tuning to speech articulators that makes accurate decoding possible from only a small region of cortex, and a detailed articulatory representation of phonemes that persists years after paralysis. These results show a feasible path forward for restoring rapid communication to people with paralysis who can no longer speak.
言语脑-机接口(BCI)有可能通过解码意图言语所引发的神经活动,将文本或声音转换为文字,从而恢复瘫痪患者的快速交流。早期的演示虽然很有前景,但尚未达到足够高的准确性,无法实现从大容量词汇中进行不受限制的句子交流。在这里,我们展示了一种基于皮层内微电极阵列记录的言语到文本 BCI。通过这些高分辨率记录,我们的研究参与者——由于肌萎缩性侧索硬化症而无法清晰说话——在 50 个单词词汇上的单词错误率达到 9.1%(比之前的最先进的言语 BCI 少 2.7 倍错误),在 125000 个单词词汇上的单词错误率达到 23.8%(据我们所知,这是首次成功展示大容量词汇解码)。我们的参与者尝试说话的速度达到每分钟 62 个单词,是之前记录的 3.4 倍,开始接近自然对话的速度(每分钟 160 个单词)。最后,我们强调了言语神经编码的两个方面,这对言语 BCI 来说是令人鼓舞的:对言语发音器官的空间混合调谐使得仅从大脑的一小部分区域就可以实现准确解码,以及对语音音位的详细发音表示,即使在瘫痪多年后仍然存在。这些结果为恢复无法说话的瘫痪患者的快速交流展示了一条可行的道路。