• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多模态RSVP的目标检测的跨模态引导与重加权网络

Cross-modal guiding and reweighting network for multi-modal RSVP-based target detection.

作者信息

Mao Jiayu, Qiu Shuang, Wei Wei, He Huiguang

机构信息

Laboratory of Brain Atlas and Brain-Inspired Intelligence, State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, China; School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, China.

Laboratory of Brain Atlas and Brain-Inspired Intelligence, State Key Laboratory of Multimodal Artificial Intelligence Systems, Institute of Automation, Chinese Academy of Sciences, Beijing, China.

出版信息

Neural Netw. 2023 Apr;161:65-82. doi: 10.1016/j.neunet.2023.01.009. Epub 2023 Jan 16.

DOI:10.1016/j.neunet.2023.01.009
PMID:36736001
Abstract

Rapid Serial Visual Presentation (RSVP) based Brain-Computer Interface (BCI) facilities the high-throughput detection of rare target images by detecting evoked event-related potentials (ERPs). At present, the decoding accuracy of the RSVP-based BCI system limits its practical applications. This study introduces eye movements (gaze and pupil information), referred to as EYE modality, as another useful source of information to combine with EEG-based BCI and forms a novel target detection system to detect target images in RSVP tasks. We performed an RSVP experiment, recorded the EEG signals and eye movements simultaneously during a target detection task, and constructed a multi-modal dataset including 20 subjects. Also, we proposed a cross-modal guiding and fusion network to fully utilize EEG and EYE modalities and fuse them for better RSVP decoding performance. In this network, a two-branch backbone was built to extract features from these two modalities. A Cross-Modal Feature Guiding (CMFG) module was proposed to guide EYE modality features to complement the EEG modality for better feature extraction. A Multi-scale Multi-modal Reweighting (MMR) module was proposed to enhance the multi-modal features by exploring intra- and inter-modal interactions. And, a Dual Activation Fusion (DAF) was proposed to modulate the enhanced multi-modal features for effective fusion. Our proposed network achieved a balanced accuracy of 88.00% (±2.29) on the collected dataset. The ablation studies and visualizations revealed the effectiveness of the proposed modules. This work implies the effectiveness of introducing the EYE modality in RSVP tasks. And, our proposed network is a promising method for RSVP decoding and further improves the performance of RSVP-based target detection systems.

摘要

基于快速序列视觉呈现(RSVP)的脑机接口(BCI)通过检测诱发的事件相关电位(ERP)来实现对罕见目标图像的高通量检测。目前,基于RSVP的BCI系统的解码精度限制了其实际应用。本研究引入眼动(注视和瞳孔信息),称为EYE模态,作为另一个有用的信息源与基于脑电图的BCI相结合,并形成一种新颖的目标检测系统,以检测RSVP任务中的目标图像。我们进行了一项RSVP实验,在目标检测任务期间同时记录脑电图信号和眼动,并构建了一个包含20名受试者的多模态数据集。此外,我们提出了一种跨模态引导与融合网络,以充分利用脑电图和EYE模态,并将它们融合以获得更好的RSVP解码性能。在这个网络中,构建了一个双分支主干来从这两种模态中提取特征。提出了一种跨模态特征引导(CMFG)模块来引导EYE模态特征以补充脑电图模态,从而实现更好的特征提取。提出了一种多尺度多模态重加权(MMR)模块,通过探索模态内和模态间的相互作用来增强多模态特征。并且,提出了一种双激活融合(DAF)来调制增强后的多模态特征以进行有效融合。我们提出的网络在收集的数据集上实现了88.00%(±2.29)的平衡准确率。消融研究和可视化结果揭示了所提出模块的有效性。这项工作表明在RSVP任务中引入EYE模态的有效性。并且,我们提出的网络是一种有前途的RSVP解码方法,进一步提高了基于RSVP的目标检测系统的性能。

相似文献

1
Cross-modal guiding and reweighting network for multi-modal RSVP-based target detection.基于多模态RSVP的目标检测的跨模态引导与重加权网络
Neural Netw. 2023 Apr;161:65-82. doi: 10.1016/j.neunet.2023.01.009. Epub 2023 Jan 16.
2
Attention-based convolutional neural network with multi-modal temporal information fusion for motor imagery EEG decoding.基于注意力的卷积神经网络与多模态时间信息融合在运动想象 EEG 解码中的应用。
Comput Biol Med. 2024 Jun;175:108504. doi: 10.1016/j.compbiomed.2024.108504. Epub 2024 Apr 24.
3
Multi-source domain adaptation based tempo-spatial convolution network for cross-subject EEG classification in RSVP task.基于多源域自适应的时空卷积网络用于RSVP任务中的跨主体脑电分类
J Neural Eng. 2024 Feb 16;21(1). doi: 10.1088/1741-2552/ad2710.
4
SSVEP-assisted RSVP brain-computer interface paradigm for multi-target classification.基于 SSVEP 辅助的 RSVP 脑-机接口范式的多目标分类。
J Neural Eng. 2021 Feb 23;18(1). doi: 10.1088/1741-2552/abd1c0.
5
LDER: a classification framework based on ERP enhancement in RSVP task.LDER:基于 RSVP 任务中 ERP 增强的分类框架。
J Neural Eng. 2023 Jun 8;20(3). doi: 10.1088/1741-2552/acd95d.
6
A Cross-Session Dataset for Collaborative Brain-Computer Interfaces Based on Rapid Serial Visual Presentation.一个基于快速序列视觉呈现的用于协作式脑机接口的跨会话数据集。
Front Neurosci. 2020 Oct 22;14:579469. doi: 10.3389/fnins.2020.579469. eCollection 2020.
7
Enhancing the EEG classification in RSVP task by combining interval model of ERPs with spatial and temporal regions of interest.通过将 ERP 的区间模型与时空兴趣区域相结合,提高 RSVP 任务中的 EEG 分类。
J Neural Eng. 2021 Feb 5;18(1). doi: 10.1088/1741-2552/abc8d5.
8
ERP prototypical matching net: a meta-learning method for zero-calibration RSVP-based image retrieval.ERP 原型匹配网络:一种基于零校准 RSVP 的元学习图像检索方法。
J Neural Eng. 2022 Apr 4;19(2). doi: 10.1088/1741-2552/ac5eb7.
9
Single-Trial EEG Classification Using Spatio-Temporal Weighting and Correlation Analysis for RSVP-Based Collaborative Brain Computer Interface.基于 RSVP 的协作脑机接口的时空加权和相关分析的单次脑电分类
IEEE Trans Biomed Eng. 2024 Feb;71(2):553-562. doi: 10.1109/TBME.2023.3309255. Epub 2024 Jan 19.
10
Reducing Calibration Efforts in RSVP Tasks With Multi-Source Adversarial Domain Adaptation.多源对抗域自适应在 RSVP 任务中减少校准工作。
IEEE Trans Neural Syst Rehabil Eng. 2020 Nov;28(11):2344-2355. doi: 10.1109/TNSRE.2020.3023761. Epub 2020 Nov 6.

引用本文的文献

1
A MultiModal Vigilance (MMV) dataset during RSVP and SSVEP brain-computer interface tasks.RSVP 和 SSVEP 脑-机接口任务中的多模态警觉(MMV)数据集。
Sci Data. 2024 Aug 10;11(1):867. doi: 10.1038/s41597-024-03729-8.
2
Time-band network model and binary tree algorithm for multimodal irregular flight recovery.用于多模态不规则航班恢复的时间带网络模型和二叉树算法。
Sci Rep. 2024 Mar 4;14(1):5242. doi: 10.1038/s41598-024-56000-w.
3
[Research progress of brain-computer interface application paradigms based on rapid serial visual presentation].
基于快速序列视觉呈现的脑机接口应用范式研究进展
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Dec 25;40(6):1235-1241. doi: 10.7507/1001-5515.202305061.
4
A novel feature fusion network for multimodal emotion recognition from EEG and eye movement signals.一种用于从脑电图和眼动信号中进行多模态情感识别的新型特征融合网络。
Front Neurosci. 2023 Aug 3;17:1234162. doi: 10.3389/fnins.2023.1234162. eCollection 2023.