Department of Cognitive and Neural Systems and Sargent College of Health and Rehabilitation Sciences, Boston University, Boston, Massachusetts, United States of America.
PLoS One. 2009 Dec 9;4(12):e8218. doi: 10.1371/journal.pone.0008218.
Brain-machine interfaces (BMIs) involving electrodes implanted into the human cerebral cortex have recently been developed in an attempt to restore function to profoundly paralyzed individuals. Current BMIs for restoring communication can provide important capabilities via a typing process, but unfortunately they are only capable of slow communication rates. In the current study we use a novel approach to speech restoration in which we decode continuous auditory parameters for a real-time speech synthesizer from neuronal activity in motor cortex during attempted speech.
METHODOLOGY/PRINCIPAL FINDINGS: Neural signals recorded by a Neurotrophic Electrode implanted in a speech-related region of the left precentral gyrus of a human volunteer suffering from locked-in syndrome, characterized by near-total paralysis with spared cognition, were transmitted wirelessly across the scalp and used to drive a speech synthesizer. A Kalman filter-based decoder translated the neural signals generated during attempted speech into continuous parameters for controlling a synthesizer that provided immediate (within 50 ms) auditory feedback of the decoded sound. Accuracy of the volunteer's vowel productions with the synthesizer improved quickly with practice, with a 25% improvement in average hit rate (from 45% to 70%) and 46% decrease in average endpoint error from the first to the last block of a three-vowel task.
CONCLUSIONS/SIGNIFICANCE: Our results support the feasibility of neural prostheses that may have the potential to provide near-conversational synthetic speech output for individuals with severely impaired speech motor control. They also provide an initial glimpse into the functional properties of neurons in speech motor cortical areas.
涉及植入人脑皮层的电极的脑机接口 (BMI) 最近已经开发出来,试图恢复严重瘫痪患者的功能。目前用于恢复交流的 BMI 可以通过打字过程提供重要功能,但不幸的是,它们只能实现缓慢的通信速率。在当前的研究中,我们使用了一种新的方法来恢复语音,我们从运动皮层在尝试说话期间的神经元活动中解码实时语音合成器的连续听觉参数。
方法/主要发现: 一名患有闭锁综合征的人类志愿者(其特征是几乎完全瘫痪但认知能力未受损)的左中央前回的与言语相关区域植入了神经滋养电极,记录了神经信号,这些信号通过无线方式穿过头皮并用于驱动语音合成器。基于卡尔曼滤波器的解码器将尝试说话期间产生的神经信号转换为控制合成器的连续参数,该合成器提供解码声音的即时(在 50 毫秒内)听觉反馈。志愿者使用合成器进行元音发音的准确性随着练习的进行迅速提高,平均命中率提高了 25%(从 45%提高到 70%),在三个元音任务的第一个和最后一个块之间,平均端点误差降低了 46%。
结论/意义: 我们的结果支持神经假体的可行性,神经假体有可能为严重言语运动控制受损的个体提供近乎对话式的合成语音输出。它们还初步揭示了言语运动皮层区域神经元的功能特性。