一种基于音频辅助视觉诱发电位脑电图和时空注意力卷积神经网络的新型脑机接口。

A novel brain-computer interface based on audio-assisted visual evoked EEG and spatial-temporal attention CNN.

作者信息

Chen Guijun, Zhang Xueying, Zhang Jing, Li Fenglian, Duan Shufei

机构信息

College of Information and Computer, Taiyuan University of Technology, Taiyuan, China.

出版信息

Front Neurorobot. 2022 Sep 30;16:995552. doi: 10.3389/fnbot.2022.995552. eCollection 2022.

DOI:10.3389/fnbot.2022.995552

PMID:36247357

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9561921/

Abstract

OBJECTIVE

Brain-computer interface (BCI) can translate intentions directly into instructions and greatly improve the interaction experience for disabled people or some specific interactive applications. To improve the efficiency of BCI, the objective of this study is to explore the feasibility of an audio-assisted visual BCI speller and a deep learning-based single-trial event related potentials (ERP) decoding strategy.

APPROACH

In this study, a two-stage BCI speller combining the motion-onset visual evoked potential (mVEP) and semantically congruent audio evoked ERP was designed to output the target characters. In the first stage, the different group of characters were presented in the different locations of visual field simultaneously and the stimuli were coded to the mVEP based on a new space division multiple access scheme. And then, the target character can be output based on the audio-assisted mVEP in the second stage. Meanwhile, a spatial-temporal attention-based convolutional neural network (STA-CNN) was proposed to recognize the single-trial ERP components. The CNN can learn 2-dimentional features including the spatial information of different activated channels and time dependence among ERP components. In addition, the STA mechanism can enhance the discriminative event-related features by adaptively learning probability weights.

MAIN RESULTS

The performance of the proposed two-stage audio-assisted visual BCI paradigm and STA-CNN model was evaluated using the Electroencephalogram (EEG) recorded from 10 subjects. The average classification accuracy of proposed STA-CNN can reach 59.6 and 77.7% for the first and second stages, which were always significantly higher than those of the comparison methods ( < 0.05).

SIGNIFICANCE

The proposed two-stage audio-assisted visual paradigm showed a great potential to be used to BCI speller. Moreover, through the analysis of the attention weights from time sequence and spatial topographies, it was proved that STA-CNN could effectively extract interpretable spatiotemporal EEG features.

摘要

目的

脑机接口（BCI）能够将意图直接转化为指令，极大地改善残疾人或某些特定交互应用的交互体验。为提高BCI的效率，本研究的目的是探索一种音频辅助视觉BCI拼写器和基于深度学习的单次试验事件相关电位（ERP）解码策略的可行性。

方法

在本研究中，设计了一种结合运动起始视觉诱发电位（mVEP）和语义一致音频诱发ERP的两阶段BCI拼写器来输出目标字符。在第一阶段，不同组的字符同时呈现在视野的不同位置，并且基于一种新的空分多址方案将刺激编码为mVEP。然后，在第二阶段基于音频辅助的mVEP输出目标字符。同时，提出了一种基于时空注意力的卷积神经网络（STA-CNN）来识别单次试验ERP成分。该卷积神经网络可以学习二维特征，包括不同激活通道的空间信息以及ERP成分之间的时间依赖性。此外，STA机制可以通过自适应学习概率权重来增强与事件相关的判别特征。

主要结果

使用从10名受试者记录的脑电图（EEG）对所提出的两阶段音频辅助视觉BCI范式和STA-CNN模型的性能进行了评估。所提出的STA-CNN在第一阶段和第二阶段的平均分类准确率分别可达59.6%和77.7%，始终显著高于比较方法（<0.05）。

意义

所提出的两阶段音频辅助视觉范式显示出用于BCI拼写器的巨大潜力。此外，通过对时间序列和空间地形图的注意力权重分析，证明了STA-CNN能够有效地提取可解释的时空脑电特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c8b3/9561921/197f857f1f2e/fnbot-16-995552-g0002.jpg

相似文献

A novel brain-computer interface based on audio-assisted visual evoked EEG and spatial-temporal attention CNN.

Front Neurorobot. 2022 Sep 30;16:995552. doi: 10.3389/fnbot.2022.995552. eCollection 2022.

The extraction of motion-onset VEP BCI features based on deep learning and compressed sensing.

J Neurosci Methods. 2017 Jan 1;275:80-92. doi: 10.1016/j.jneumeth.2016.11.002. Epub 2016 Nov 11.

IENet: a robust convolutional neural network for EEG based brain-computer interfaces.

J Neural Eng. 2022 Jun 7;19(3). doi: 10.1088/1741-2552/ac7257.

A deep learning method for single-trial EEG classification in RSVP task based on spatiotemporal features of ERPs.

J Neural Eng. 2021 Aug 13;18(4). doi: 10.1088/1741-2552/ac1610.

(C)overt attention and visual speller design in an ERP-based brain-computer interface.

Behav Brain Funct. 2010 May 28;6:28. doi: 10.1186/1744-9081-6-28.

A sLORETA study for gaze-independent BCI speller.

Annu Int Conf IEEE Eng Med Biol Soc. 2017 Jul;2017:994-997. doi: 10.1109/EMBC.2017.8036993.

Somatosensory Event-Related Potential as an Electrophysiological Correlate of Endogenous Spatial Tactile Attention: Prospects for Electrotactile Brain-Computer Interface for Sensory Training.

Brain Sci. 2023 May 5;13(5):766. doi: 10.3390/brainsci13050766.

Novel electrotactile brain-computer interface with somatosensory event-related potential based control.

Front Hum Neurosci. 2023 Mar 23;17:1096814. doi: 10.3389/fnhum.2023.1096814. eCollection 2023.

A Single-Trial P300 Detector Based on Symbolized EEG and Autoencoded-(1D)CNN to Improve ITR Performance in BCIs.

Sensors (Basel). 2021 Jun 8;21(12):3961. doi: 10.3390/s21123961.

Mental fatigue in central-field and peripheral-field steady-state visually evoked potential and its effects on event-related potential responses.

Neuroreport. 2018 Oct 17;29(15):1301-1308. doi: 10.1097/WNR.0000000000001111.

引用本文的文献

CircPCBL: Identification of Plant CircRNAs with a CNN-BiGRU-GLT Model.

Plants (Basel). 2023 Apr 14;12(8):1652. doi: 10.3390/plants12081652.

EVlncRNA-Dpred: improved prediction of experimentally validated lncRNAs by deep learning.

Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac583.

本文引用的文献

Analytic beamformer transformation for transfer learning in motion-onset visual evoked potential decoding.

J Neural Eng. 2022 Apr 14;19(2). doi: 10.1088/1741-2552/ac636a.

Phase-Spatial Beamforming Renders a Visual Brain Computer Interface Capable of Exploiting EEG Electrode Phase Shifts in Motion-Onset Target Responses.

IEEE Trans Biomed Eng. 2022 May;69(5):1802-1812. doi: 10.1109/TBME.2021.3136938. Epub 2022 Apr 21.

A deep learning method for single-trial EEG classification in RSVP task based on spatiotemporal features of ERPs.

J Neural Eng. 2021 Aug 13;18(4). doi: 10.1088/1741-2552/ac1610.

Target Detection Using Ternary Classification During a Rapid Serial Visual Presentation Task Using Magnetoencephalography Data.

Front Comput Neurosci. 2021 Feb 26;15:619508. doi: 10.3389/fncom.2021.619508. eCollection 2021.

Improving the Cross-Subject Performance of the ERP-Based Brain-Computer Interface Using Rapid Serial Visual Presentation and Correlation Analysis Rank.

Front Hum Neurosci. 2020 Jul 31;14:296. doi: 10.3389/fnhum.2020.00296. eCollection 2020.

A Practical EEG-Based Human-Machine Interface to Online Control an Upper-Limb Assist Robot.

Front Neurorobot. 2020 Jul 10;14:32. doi: 10.3389/fnbot.2020.00032. eCollection 2020.

Doubling the Speed of N200 Speller via Dual-Directional Motion Encoding.

IEEE Trans Biomed Eng. 2021 Jan;68(1):204-213. doi: 10.1109/TBME.2020.3005518. Epub 2020 Dec 21.

Time-varying networks of ERPs in P300-speller paradigms based on spatially and semantically congruent audiovisual bimodality.

J Neural Eng. 2020 Jul 24;17(4):046015. doi: 10.1088/1741-2552/aba07f.

Implementing Over 100 Command Codes for a High-Speed Hybrid Brain-Computer Interface Using Concurrent P300 and SSVEP Features.

IEEE Trans Biomed Eng. 2020 Nov;67(11):3073-3082. doi: 10.1109/TBME.2020.2975614. Epub 2020 Mar 3.

Discriminative Canonical Pattern Matching for Single-Trial Classification of ERP Components.

IEEE Trans Biomed Eng. 2020 Aug;67(8):2266-2275. doi: 10.1109/TBME.2019.2958641. Epub 2019 Dec 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种基于音频辅助视觉诱发电位脑电图和时空注意力卷积神经网络的新型脑机接口。

A novel brain-computer interface based on audio-assisted visual evoked EEG and spatial-temporal attention CNN.

作者信息

机构信息

出版信息

OBJECTIVE

APPROACH

MAIN RESULTS

SIGNIFICANCE

目的

方法

主要结果

意义

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献