Department of Neurological Surgery, UC San Francisco, CA, United States of America. Center for Integrative Neuroscience, UC San Francisco, CA, United States of America. Graduate Program in Bioengineering, UC Berkeley-UC San Francisco, CA, United States of America.
J Neural Eng. 2018 Jun;15(3):036005. doi: 10.1088/1741-2552/aaab6f. Epub 2018 Jan 30.
Recent research has characterized the anatomical and functional basis of speech perception in the human auditory cortex. These advances have made it possible to decode speech information from activity in brain regions like the superior temporal gyrus, but no published work has demonstrated this ability in real-time, which is necessary for neuroprosthetic brain-computer interfaces.
Here, we introduce a real-time neural speech recognition (rtNSR) software package, which was used to classify spoken input from high-resolution electrocorticography signals in real-time. We tested the system with two human subjects implanted with electrode arrays over the lateral brain surface. Subjects listened to multiple repetitions of ten sentences, and rtNSR classified what was heard in real-time from neural activity patterns using direct sentence-level and HMM-based phoneme-level classification schemes.
We observed single-trial sentence classification accuracies of [Formula: see text] or higher for each subject with less than 7 minutes of training data, demonstrating the ability of rtNSR to use cortical recordings to perform accurate real-time speech decoding in a limited vocabulary setting.
Further development and testing of the package with different speech paradigms could influence the design of future speech neuroprosthetic applications.
最近的研究已经描述了人类听觉皮层中语音感知的解剖学和功能基础。这些进展使得从颞上回等脑区的活动中解码语音信息成为可能,但尚无发表的工作证明这种能力可以实时实现,这对于神经假体脑机接口是必要的。
在这里,我们引入了一个实时神经语音识别 (rtNSR) 软件包,该软件包用于实时对高分辨率皮层电图信号中的语音输入进行分类。我们使用两个植入了侧脑表面电极阵列的人类受试者来测试该系统。受试者听了十句话的多次重复,并使用直接基于句子级和基于 HMM 的音素级分类方案的实时神经活动模式对听到的内容进行 rtNSR 分类。
我们观察到,每个受试者在不到 7 分钟的训练数据下,单次试验的句子分类准确率达到[公式:见正文]或更高,这表明 rtNSR 能够使用皮层记录在有限词汇设置中进行准确的实时语音解码。
该软件包与不同语音范式的进一步开发和测试可能会影响未来语音神经假体应用的设计。