• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于功能近红外光谱技术(fNIRS)脑信号的语音识别

Speech Recognition via fNIRS Based Brain Signals.

作者信息

Liu Yichuan, Ayaz Hasan

机构信息

School of Biomedical Engineering, Drexel University, Science and Health Systems, Philadelphia, PA, United States.

Cognitive Neuroengineering and Quantitative Experimental Research (CONQUER) Collaborative, Drexel University, Philadelphia, PA, United States.

出版信息

Front Neurosci. 2018 Oct 9;12:695. doi: 10.3389/fnins.2018.00695. eCollection 2018.

DOI:10.3389/fnins.2018.00695
PMID:30356771
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6189799/
Abstract

In this paper, we present the first evidence that perceived speech can be identified from the listeners' brain signals measured via functional-near infrared spectroscopy (fNIRS)-a non-invasive, portable, and wearable neuroimaging technique suitable for ecologically valid settings. In this study, participants listened audio clips containing English stories while prefrontal and parietal cortices were monitored with fNIRS. Machine learning was applied to train predictive models using fNIRS data from a subject pool to predict which part of a story was listened by a new subject not in the pool based on the brain's hemodynamic response as measured by fNIRS. fNIRS signals can vary considerably from subject to subject due to the different head size, head shape, and spatial locations of brain functional regions. To overcome this difficulty, a generalized canonical correlation analysis (GCCA) was adopted to extract latent variables that are shared among the listeners before applying principal component analysis (PCA) for dimension reduction and applying logistic regression for classification. A 74.7% average accuracy has been achieved for differentiating between two 50 s. long story segments and a 43.6% average accuracy has been achieved for differentiating four 25 s. long story segments. These results suggest the potential of an fNIRS based-approach for building a speech decoding brain-computer-interface for developing a new type of neural prosthetic system.

摘要

在本文中,我们首次证明,通过功能近红外光谱技术(fNIRS)——一种适用于生态有效环境的非侵入性、便携式且可穿戴的神经成像技术,能够从听众的大脑信号中识别出感知到的语音。在这项研究中,参与者聆听包含英语故事的音频片段,同时用fNIRS监测前额叶和顶叶皮质。运用机器学习,利用来自一个受试者池的fNIRS数据训练预测模型,以便根据fNIRS测量的大脑血液动力学反应,预测不在该受试者池中的新受试者听了故事的哪一部分。由于头部大小、头部形状以及大脑功能区域的空间位置不同,fNIRS信号在不同受试者之间可能会有很大差异。为克服这一困难,在应用主成分分析(PCA)进行降维和应用逻辑回归进行分类之前,采用广义典型相关分析(GCCA)来提取听众之间共享的潜在变量。区分两个50秒长的故事片段时平均准确率达到了74.7%,区分四个25秒长的故事片段时平均准确率达到了43.6%。这些结果表明了基于fNIRS的方法在构建用于开发新型神经假体系统的语音解码脑机接口方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/8bcabe8e7863/fnins-12-00695-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/82888229ba1c/fnins-12-00695-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/88c2a6f5163c/fnins-12-00695-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/122674521ff6/fnins-12-00695-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/8a63f46bd6a4/fnins-12-00695-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/f3dd89ccae2d/fnins-12-00695-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/3e33f7c1b29d/fnins-12-00695-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/8bcabe8e7863/fnins-12-00695-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/82888229ba1c/fnins-12-00695-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/88c2a6f5163c/fnins-12-00695-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/122674521ff6/fnins-12-00695-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/8a63f46bd6a4/fnins-12-00695-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/f3dd89ccae2d/fnins-12-00695-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/3e33f7c1b29d/fnins-12-00695-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/61ec/6189799/8bcabe8e7863/fnins-12-00695-g0007.jpg

相似文献

1
Speech Recognition via fNIRS Based Brain Signals.基于功能近红外光谱技术(fNIRS)脑信号的语音识别
Front Neurosci. 2018 Oct 9;12:695. doi: 10.3389/fnins.2018.00695. eCollection 2018.
2
Measuring speaker-listener neural coupling with functional near infrared spectroscopy.运用功能近红外光谱技术测量说话者-倾听者的神经耦合。
Sci Rep. 2017 Feb 27;7:43293. doi: 10.1038/srep43293.
3
K-Means Clustering Machine Learning Approach Reveals Groups of Homogeneous Individuals With Unique Brain Activation, Task, and Performance Dynamics Using fNIRS.K-均值聚类机器学习方法利用 fNIRS 揭示了具有独特大脑激活、任务和表现动态的同质个体群组。
IEEE Trans Neural Syst Rehabil Eng. 2023;31:2535-2544. doi: 10.1109/TNSRE.2023.3278268. Epub 2023 Jun 6.
4
Imagined speech increases the hemodynamic response and functional connectivity of the dorsal motor cortex.想象中的言语会增加大脑背侧运动皮质的血流动力学反应和功能连接。
J Neural Eng. 2021 Oct 7;18(5). doi: 10.1088/1741-2552/ac25d9.
5
Decoding four different sound-categories in the auditory cortex using functional near-infrared spectroscopy.使用功能近红外光谱技术在听觉皮层中解码四种不同的声音类别。
Hear Res. 2016 Mar;333:157-166. doi: 10.1016/j.heares.2016.01.009. Epub 2016 Jan 29.
6
Subject-Independent Functional Near-Infrared Spectroscopy-Based Brain-Computer Interfaces Based on Convolutional Neural Networks.基于卷积神经网络的独立于受试者的基于功能近红外光谱的脑机接口
Front Hum Neurosci. 2021 Mar 12;15:646915. doi: 10.3389/fnhum.2021.646915. eCollection 2021.
7
Hybrid EEG-fNIRS-Based Eight-Command Decoding for BCI: Application to Quadcopter Control.基于脑电图和功能近红外光谱的脑机接口八指令解码:在四轴飞行器控制中的应用
Front Neurorobot. 2017 Feb 17;11:6. doi: 10.3389/fnbot.2017.00006. eCollection 2017.
8
Exploratory insights into prefrontal cortex activity in continuous glucose monitoring: findings from a portable wearable functional near-infrared spectroscopy system.连续血糖监测中前额叶皮层活动的探索性见解:来自便携式可穿戴功能近红外光谱系统的发现
Front Neurosci. 2024 May 8;18:1342744. doi: 10.3389/fnins.2024.1342744. eCollection 2024.
9
Decoding Articulation Motor Imagery Using Early Connectivity Information in the Motor Cortex: A Functional Near-Infrared Spectroscopy Study.利用运动皮层早期连接信息解码关节运动想象:一项功能近红外光谱研究
IEEE Trans Neural Syst Rehabil Eng. 2023;31:506-518. doi: 10.1109/TNSRE.2022.3227595. Epub 2023 Feb 1.
10
A hybrid BCI based on EEG and fNIRS signals improves the performance of decoding motor imagery of both force and speed of hand clenching.一种基于脑电图(EEG)和功能近红外光谱(fNIRS)信号的混合脑机接口(BCI)提高了对手部紧握力和速度的运动想象解码性能。
J Neural Eng. 2015 Jun;12(3):036004. doi: 10.1088/1741-2560/12/3/036004. Epub 2015 Apr 2.

引用本文的文献

1
Neural signals, machine learning, and the future of inner speech recognition.神经信号、机器学习与内心言语识别的未来
Front Hum Neurosci. 2025 Jul 10;19:1637174. doi: 10.3389/fnhum.2025.1637174. eCollection 2025.
2
Classification of Imagined Speech Signals Using Functional Connectivity Graphs and Machine Learning Models.使用功能连接图和机器学习模型对想象语音信号进行分类
Brain Topogr. 2025 Jan 28;38(2):25. doi: 10.1007/s10548-025-01100-7.
3
Boosting brain-computer interfaces with functional electrical stimulation: potential applications in people with locked-in syndrome.

本文引用的文献

1
Identifying musical pieces from fMRI data using encoding and decoding models.基于编码和解码模型从 fMRI 数据中识别音乐片段。
Sci Rep. 2018 Feb 2;8(1):2266. doi: 10.1038/s41598-018-20732-3.
2
Multisubject "Learning" for Mental Workload Classification Using Concurrent EEG, fNIRS, and Physiological Measures.使用同步脑电图、功能近红外光谱和生理测量进行多主体“学习”以实现心理负荷分类
Front Hum Neurosci. 2017 Jul 27;11:389. doi: 10.3389/fnhum.2017.00389. eCollection 2017.
3
Mapping between fMRI responses to movies and their natural language annotations.
功能性电刺激增强脑机接口:在闭锁综合征患者中的潜在应用。
J Neuroeng Rehabil. 2023 Nov 18;20(1):157. doi: 10.1186/s12984-023-01272-y.
4
Decoding of the speech envelope from EEG using the VLAAI deep neural network.使用 VLAAI 深度神经网络对 EEG 进行语音包络解码。
Sci Rep. 2023 Jan 16;13(1):812. doi: 10.1038/s41598-022-27332-2.
5
Beyond the brain-computer interface: Decoding brain activity as a tool to understand neuronal mechanisms subtending cognition and behavior.超越脑机接口:将脑活动解码作为理解支撑认知与行为的神经元机制的工具
Front Neurosci. 2022 Sep 8;16:811736. doi: 10.3389/fnins.2022.811736. eCollection 2022.
6
Dataset of Speech Production in intracranial.Electroencephalography.颅内脑电图语音产生数据集。
Sci Data. 2022 Jul 22;9(1):434. doi: 10.1038/s41597-022-01542-9.
7
Neural correlates of cognitive control in women with a history of sexual violence suggest altered prefrontal cortical activity during cognitive processing.有性暴力史女性认知控制的神经关联表明,在认知加工过程中前额叶皮质活动发生了改变。
Womens Health (Lond). 2022 Jan-Dec;18:17455057221081326. doi: 10.1177/17455057221081326.
8
Projections and the Potential Societal Impact of the Future of Neurotechnologies.神经技术未来的预测及其潜在的社会影响。
Front Neurosci. 2021 Nov 15;15:658930. doi: 10.3389/fnins.2021.658930. eCollection 2021.
9
A low-cost, wearable, do-it-yourself functional near-infrared spectroscopy (DIY-fNIRS) headband.一种低成本、可穿戴的自制功能性近红外光谱(DIY-fNIRS)头带。
HardwareX. 2021 Oct;10. doi: 10.1016/j.ohx.2021.e00204. Epub 2021 May 20.
10
A New Statistical Approach for fNIRS Hyperscanning to Predict Brain Activity of Preschoolers' Using Teacher's.一种用于功能近红外光谱超扫描的新统计方法,以利用教师的情况预测学龄前儿童的大脑活动。
Front Hum Neurosci. 2021 May 7;15:622146. doi: 10.3389/fnhum.2021.622146. eCollection 2021.
电影 fMRI 响应与其自然语言注释之间的映射。
Neuroimage. 2018 Oct 15;180(Pt A):223-231. doi: 10.1016/j.neuroimage.2017.06.042. Epub 2017 Jun 23.
4
Measuring speaker-listener neural coupling with functional near infrared spectroscopy.运用功能近红外光谱技术测量说话者-倾听者的神经耦合。
Sci Rep. 2017 Feb 27;7:43293. doi: 10.1038/srep43293.
5
Automatic Speech Recognition from Neural Signals: A Focused Review.基于神经信号的自动语音识别:综述
Front Neurosci. 2016 Sep 27;10:429. doi: 10.3389/fnins.2016.00429. eCollection 2016.
6
Decoding of Covert Vowel Articulation Using Electroencephalography Cortical Currents.利用脑电图皮层电流对隐蔽元音发音进行解码
Front Neurosci. 2016 May 3;10:175. doi: 10.3389/fnins.2016.00175. eCollection 2016.
7
Word pair classification during imagined speech using direct brain recordings.使用直接脑记录对想象言语中的词对进行分类。
Sci Rep. 2016 May 11;6:25803. doi: 10.1038/srep25803.
8
Brain-to-text: decoding spoken phrases from phone representations in the brain.脑到文本:从大脑中的语音表征解码口语短语。
Front Neurosci. 2015 Jun 12;9:217. doi: 10.3389/fnins.2015.00217. eCollection 2015.
9
Wearable functional near infrared spectroscopy (fNIRS) and transcranial direct current stimulation (tDCS): expanding vistas for neurocognitive augmentation.可穿戴式功能性近红外光谱(fNIRS)和经颅直流电刺激(tDCS):神经认知增强的广阔前景。
Front Syst Neurosci. 2015 Mar 9;9:27. doi: 10.3389/fnsys.2015.00027. eCollection 2015.
10
Hybrid fNIRS-EEG based classification of auditory and visual perception processes.基于混合功能近红外光谱-脑电图的听觉和视觉感知过程分类
Front Neurosci. 2014 Nov 18;8:373. doi: 10.3389/fnins.2014.00373. eCollection 2014.