Liu Yichuan, Ayaz Hasan
School of Biomedical Engineering, Drexel University, Science and Health Systems, Philadelphia, PA, United States.
Cognitive Neuroengineering and Quantitative Experimental Research (CONQUER) Collaborative, Drexel University, Philadelphia, PA, United States.
Front Neurosci. 2018 Oct 9;12:695. doi: 10.3389/fnins.2018.00695. eCollection 2018.
In this paper, we present the first evidence that perceived speech can be identified from the listeners' brain signals measured via functional-near infrared spectroscopy (fNIRS)-a non-invasive, portable, and wearable neuroimaging technique suitable for ecologically valid settings. In this study, participants listened audio clips containing English stories while prefrontal and parietal cortices were monitored with fNIRS. Machine learning was applied to train predictive models using fNIRS data from a subject pool to predict which part of a story was listened by a new subject not in the pool based on the brain's hemodynamic response as measured by fNIRS. fNIRS signals can vary considerably from subject to subject due to the different head size, head shape, and spatial locations of brain functional regions. To overcome this difficulty, a generalized canonical correlation analysis (GCCA) was adopted to extract latent variables that are shared among the listeners before applying principal component analysis (PCA) for dimension reduction and applying logistic regression for classification. A 74.7% average accuracy has been achieved for differentiating between two 50 s. long story segments and a 43.6% average accuracy has been achieved for differentiating four 25 s. long story segments. These results suggest the potential of an fNIRS based-approach for building a speech decoding brain-computer-interface for developing a new type of neural prosthetic system.
在本文中,我们首次证明,通过功能近红外光谱技术(fNIRS)——一种适用于生态有效环境的非侵入性、便携式且可穿戴的神经成像技术,能够从听众的大脑信号中识别出感知到的语音。在这项研究中,参与者聆听包含英语故事的音频片段,同时用fNIRS监测前额叶和顶叶皮质。运用机器学习,利用来自一个受试者池的fNIRS数据训练预测模型,以便根据fNIRS测量的大脑血液动力学反应,预测不在该受试者池中的新受试者听了故事的哪一部分。由于头部大小、头部形状以及大脑功能区域的空间位置不同,fNIRS信号在不同受试者之间可能会有很大差异。为克服这一困难,在应用主成分分析(PCA)进行降维和应用逻辑回归进行分类之前,采用广义典型相关分析(GCCA)来提取听众之间共享的潜在变量。区分两个50秒长的故事片段时平均准确率达到了74.7%,区分四个25秒长的故事片段时平均准确率达到了43.6%。这些结果表明了基于fNIRS的方法在构建用于开发新型神经假体系统的语音解码脑机接口方面的潜力。