Department of Neurological Surgery, University of California, San Francisco, San Francisco, CA, USA.
Weill Institute for Neuroscience, University of California, San Francisco, San Francisco, CA, USA.
Nat Rev Neurosci. 2024 Jul;25(7):473-492. doi: 10.1038/s41583-024-00819-9. Epub 2024 May 14.
Loss of speech after paralysis is devastating, but circumventing motor-pathway injury by directly decoding speech from intact cortical activity has the potential to restore natural communication and self-expression. Recent discoveries have defined how key features of speech production are facilitated by the coordinated activity of vocal-tract articulatory and motor-planning cortical representations. In this Review, we highlight such progress and how it has led to successful speech decoding, first in individuals implanted with intracranial electrodes for clinical epilepsy monitoring and subsequently in individuals with paralysis as part of early feasibility clinical trials to restore speech. We discuss high-spatiotemporal-resolution neural interfaces and the adaptation of state-of-the-art speech computational algorithms that have driven rapid and substantial progress in decoding neural activity into text, audible speech, and facial movements. Although restoring natural speech is a long-term goal, speech neuroprostheses already have performance levels that surpass communication rates offered by current assistive-communication technology. Given this accelerated rate of progress in the field, we propose key evaluation metrics for speed and accuracy, among others, to help standardize across studies. We finish by highlighting several directions to more fully explore the multidimensional feature space of speech and language, which will continue to accelerate progress towards a clinically viable speech neuroprosthesis.
瘫痪后丧失语言能力是毁灭性的,但通过直接从完整的皮质活动解码言语来绕过运动通路损伤,有可能恢复自然的交流和自我表达。最近的发现定义了言语产生的关键特征是如何通过声道发音和运动规划皮质代表的协调活动来促进的。在这篇综述中,我们强调了这些进展,以及它们如何导致成功的言语解码,首先是在接受颅内电极进行临床癫痫监测的个体中,随后是在作为早期可行性临床试验的一部分以恢复言语的瘫痪个体中。我们讨论了高时空分辨率的神经接口和最先进的言语计算算法的适应性,这些算法推动了将神经活动解码为文本、可听言语和面部运动的快速和实质性进展。尽管恢复自然言语是一个长期目标,但言语神经假体已经达到了超越当前辅助交流技术提供的交流速度的性能水平。鉴于该领域的快速进展,我们提出了速度和准确性等关键评估指标,以帮助在研究之间进行标准化。最后,我们强调了探索言语和语言多维特征空间的几个方向,这将继续加速实现临床可行的言语神经假体。