用于多模态自然语言处理的脑电图脑活动解码

Decoding EEG Brain Activity for Multi-Modal Natural Language Processing.

作者信息

Hollenstein Nora, Renggli Cedric, Glaus Benjamin, Barrett Maria, Troendle Marius, Langer Nicolas, Zhang Ce

机构信息

Department of Nordic Studies and Linguistics, University of Copenhagen, Copenhagen, Denmark.

Department of Computer Science, Swiss Federal Institute of Technology, ETH Zurich, Zurich, Switzerland.

出版信息

Front Hum Neurosci. 2021 Jul 13;15:659410. doi: 10.3389/fnhum.2021.659410. eCollection 2021.

DOI:10.3389/fnhum.2021.659410

PMID:34326723

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8314009/

Abstract

Until recently, human behavioral data from reading has mainly been of interest to researchers to understand human cognition. However, these human language processing signals can also be beneficial in machine learning-based natural language processing tasks. Using EEG brain activity for this purpose is largely unexplored as of yet. In this paper, we present the first large-scale study of systematically analyzing the potential of EEG brain activity data for improving natural language processing tasks, with a special focus on which features of the signal are most beneficial. We present a multi-modal machine learning architecture that learns jointly from textual input as well as from EEG features. We find that filtering the EEG signals into frequency bands is more beneficial than using the broadband signal. Moreover, for a range of word embedding types, EEG data improves binary and ternary sentiment classification and outperforms multiple baselines. For more complex tasks such as relation detection, only the contextualized BERT embeddings outperform the baselines in our experiments, which raises the need for further research. Finally, EEG data shows to be particularly promising when limited training data is available.

摘要

直到最近，阅读中的人类行为数据主要引起研究人员的兴趣，用于理解人类认知。然而，这些人类语言处理信号在基于机器学习的自然语言处理任务中也可能有益。目前，在这方面使用脑电图（EEG）脑活动数据在很大程度上尚未得到探索。在本文中，我们展示了第一项大规模研究，系统地分析了EEG脑活动数据在改善自然语言处理任务方面的潜力，特别关注信号的哪些特征最有益。我们提出了一种多模态机器学习架构，它可以从文本输入以及EEG特征中联合学习。我们发现，将EEG信号过滤到不同频段比使用宽带信号更有益。此外，对于一系列词嵌入类型，EEG数据改善了二元和三元情感分类，并优于多个基线。对于关系检测等更复杂的任务，在我们的实验中，只有情境化的BERT嵌入优于基线，这就需要进一步研究。最后，当可用训练数据有限时，EEG数据显示出特别有前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bdba/8314009/aed263a2b4cf/fnhum-15-659410-g0001.jpg

相似文献

Decoding EEG Brain Activity for Multi-Modal Natural Language Processing.

Front Hum Neurosci. 2021 Jul 13;15:659410. doi: 10.3389/fnhum.2021.659410. eCollection 2021.

BELT: Bootstrapped EEG-to-Language Training by Natural Language Supervision.

IEEE Trans Neural Syst Rehabil Eng. 2024;32:3278-3288. doi: 10.1109/TNSRE.2024.3450795. Epub 2024 Sep 11.

A novel multi-modal machine learning based approach for automatic classification of EEG recordings in dementia.

Neural Netw. 2020 Mar;123:176-190. doi: 10.1016/j.neunet.2019.12.006. Epub 2019 Dec 14.

Transfer Learning for Sentiment Analysis Using BERT Based Supervised Fine-Tuning.

Sensors (Basel). 2022 May 30;22(11):4157. doi: 10.3390/s22114157.

Language with vision: A study on grounded word and sentence embeddings.

Behav Res Methods. 2024 Sep;56(6):5622-5646. doi: 10.3758/s13428-023-02294-z. Epub 2023 Dec 19.

ConTraNet: A hybrid network for improving the classification of EEG and EMG signals with limited training data.

Comput Biol Med. 2024 Jan;168:107649. doi: 10.1016/j.compbiomed.2023.107649. Epub 2023 Nov 2.

Effective Transfer Learning with Label-Based Discriminative Feature Learning.

Sensors (Basel). 2022 Mar 4;22(5):2025. doi: 10.3390/s22052025.

Improved biomedical word embeddings in the transformer era.

J Biomed Inform. 2021 Aug;120:103867. doi: 10.1016/j.jbi.2021.103867. Epub 2021 Jul 18.

Explainable hybrid word representations for sentiment analysis of financial news.

Neural Netw. 2023 Jul;164:115-123. doi: 10.1016/j.neunet.2023.04.011. Epub 2023 Apr 21.

Multiscale space-time-frequency feature-guided multitask learning CNN for motor imagery EEG classification.

J Neural Eng. 2021 Feb 24;18(2). doi: 10.1088/1741-2552/abd82b.

引用本文的文献

A simultaneous EEG and eye-tracking dataset for remote sensing object detection.

Sci Data. 2025 Apr 17;12(1):651. doi: 10.1038/s41597-025-04995-w.

Reconstructing signal during brain stimulation with Stim-BERT: a self-supervised learning model trained on millions of iEEG files.

Front Artif Intell. 2025 Feb 18;8:1502504. doi: 10.3389/frai.2025.1502504. eCollection 2025.

Multi-branch convolutional neural network with cross-attention mechanism for emotion recognition.

Sci Rep. 2025 Feb 1;15(1):3976. doi: 10.1038/s41598-025-88248-1.

Brain-model neural similarity reveals abstractive summarization performance.

Sci Rep. 2025 Jan 2;15(1):370. doi: 10.1038/s41598-024-84530-w.

DERCo: A Dataset for Human Behaviour in Reading Comprehension Using EEG.

Sci Data. 2024 Oct 9;11(1):1104. doi: 10.1038/s41597-024-03915-8.

Lobish: Symbolic Language for Interpreting Electroencephalogram Signals in Language Detection Using Channel-Based Transformation and Pattern.

Diagnostics (Basel). 2024 Sep 8;14(17):1987. doi: 10.3390/diagnostics14171987.

Large-scale foundation models and generative AI for BigData neuroscience.

Neurosci Res. 2024 Jun 17. doi: 10.1016/j.neures.2024.06.003.

ChineseEEG: A Chinese Linguistic Corpora EEG Dataset for Semantic Alignment and Neural Decoding.

Sci Data. 2024 May 29;11(1):550. doi: 10.1038/s41597-024-03398-7.

Detection of Language Lateralization Using Spectral Analysis of EEG.

J Clin Neurophysiol. 2024 May 1;41(4):334-343. doi: 10.1097/WNP.0000000000000988.

The ZuCo benchmark on cross-subject reading task classification with EEG and eye-tracking data.

Front Psychol. 2023 Jan 12;13:1028824. doi: 10.3389/fpsyg.2022.1028824. eCollection 2022.

本文引用的文献

Traces of Meaning Itself: Encoding Distributional Word Vectors in Brain Activity.

Neurobiol Lang (Camb). 2020 Mar 1;1(1):54-76. doi: 10.1162/nol_a_00003. eCollection 2020.

Brain2Char: a deep architecture for decoding text from brain recordings.

J Neural Eng. 2020 Dec 16;17(6). doi: 10.1088/1741-2552/abc742.

THE OF NEURAL OSCILLATIONS TO INFORM SENTENCE COMPREHENSION: A LINGUISTIC PERSPECTIVE.

Lang Linguist Compass. 2019 Sep;13(9). doi: 10.1111/lnc3.12347. Epub 2019 Aug 14.

Placing language in an integrated understanding system: Next steps toward human-level performance in neural language models.

Proc Natl Acad Sci U S A. 2020 Oct 20;117(42):25966-25974. doi: 10.1073/pnas.1910416117. Epub 2020 Sep 28.

Neural dynamics of sentiment processing during naturalistic sentence reading.

Neuroimage. 2020 Sep;218:116934. doi: 10.1016/j.neuroimage.2020.116934. Epub 2020 May 13.

Multimodal Transformer for Unaligned Multimodal Language Sequences.

Proc Conf Assoc Comput Linguist Meet. 2019 Jul;2019:6558-6569. doi: 10.18653/v1/p19-1656.

Language and motor processing in reading and typing: Insights from beta-frequency band power modulations.

Brain Lang. 2020 May;204:104758. doi: 10.1016/j.bandl.2020.104758. Epub 2020 Feb 5.

Neural theta oscillations support semantic memory retrieval.

Sci Rep. 2019 Nov 27;9(1):17667. doi: 10.1038/s41598-019-53813-y.

Unfold: an integrated toolbox for overlap correction, non-linear modeling, and regression-based EEG analysis.

PeerJ. 2019 Oct 24;7:e7838. doi: 10.7717/peerj.7838. eCollection 2019.

How are visual words represented? Insights from EEG-based visual word decoding, feature derivation and image reconstruction.

Hum Brain Mapp. 2019 Dec 1;40(17):5056-5068. doi: 10.1002/hbm.24757. Epub 2019 Aug 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于多模态自然语言处理的脑电图脑活动解码

Decoding EEG Brain Activity for Multi-Modal Natural Language Processing.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献