基于功能近红外光谱技术（fNIRS）脑信号的语音识别

Speech Recognition via fNIRS Based Brain Signals.

作者信息

Liu Yichuan, Ayaz Hasan

机构信息

School of Biomedical Engineering, Drexel University, Science and Health Systems, Philadelphia, PA, United States.

Cognitive Neuroengineering and Quantitative Experimental Research (CONQUER) Collaborative, Drexel University, Philadelphia, PA, United States.

出版信息

Front Neurosci. 2018 Oct 9;12:695. doi: 10.3389/fnins.2018.00695. eCollection 2018.

DOI:10.3389/fnins.2018.00695

PMID:30356771

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6189799/

Abstract

In this paper, we present the first evidence that perceived speech can be identified from the listeners' brain signals measured via functional-near infrared spectroscopy (fNIRS)-a non-invasive, portable, and wearable neuroimaging technique suitable for ecologically valid settings. In this study, participants listened audio clips containing English stories while prefrontal and parietal cortices were monitored with fNIRS. Machine learning was applied to train predictive models using fNIRS data from a subject pool to predict which part of a story was listened by a new subject not in the pool based on the brain's hemodynamic response as measured by fNIRS. fNIRS signals can vary considerably from subject to subject due to the different head size, head shape, and spatial locations of brain functional regions. To overcome this difficulty, a generalized canonical correlation analysis (GCCA) was adopted to extract latent variables that are shared among the listeners before applying principal component analysis (PCA) for dimension reduction and applying logistic regression for classification. A 74.7% average accuracy has been achieved for differentiating between two 50 s. long story segments and a 43.6% average accuracy has been achieved for differentiating four 25 s. long story segments. These results suggest the potential of an fNIRS based-approach for building a speech decoding brain-computer-interface for developing a new type of neural prosthetic system.

摘要

在本文中，我们首次证明，通过功能近红外光谱技术（fNIRS）——一种适用于生态有效环境的非侵入性、便携式且可穿戴的神经成像技术，能够从听众的大脑信号中识别出感知到的语音。在这项研究中，参与者聆听包含英语故事的音频片段，同时用fNIRS监测前额叶和顶叶皮质。运用机器学习，利用来自一个受试者池的fNIRS数据训练预测模型，以便根据fNIRS测量的大脑血液动力学反应，预测不在该受试者池中的新受试者听了故事的哪一部分。由于头部大小、头部形状以及大脑功能区域的空间位置不同，fNIRS信号在不同受试者之间可能会有很大差异。为克服这一困难，在应用主成分分析（PCA）进行降维和应用逻辑回归进行分类之前，采用广义典型相关分析（GCCA）来提取听众之间共享的潜在变量。区分两个50秒长的故事片段时平均准确率达到了74.7%，区分四个25秒长的故事片段时平均准确率达到了43.6%。这些结果表明了基于fNIRS的方法在构建用于开发新型神经假体系统的语音解码脑机接口方面的潜力。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于功能近红外光谱技术（fNIRS）脑信号的语音识别

Speech Recognition via fNIRS Based Brain Signals.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

基于功能近红外光谱技术（fNIRS）脑信号的语音识别

Speech Recognition via fNIRS Based Brain Signals.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献